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MURINE AND HUMAN BOX— DEPENDENT MYC— INTERACTING PROTEIN 

(BIN1) AND USES THEREFOR 

This invention was made with financial assistance 
from the National Institutes of Health Grant No. 5-P3 0- 
5 CA-10815-28. The United states government has certain 
rights in this invention. 

Field of the Invention 

This invention relates generally to cancer diagnosis 
and therapy, and more specifically, to cancers associated 
10 with the Myc oncoprotein. 

Background of the Invention 

Myc is a transcription factor and key cell growth 
regulator that is frequently deregulated in human 
malignancy, notably Burkitt's and T cell lymphomas, where 

15 myc genes suffer chromosomal translocation. In colon and 
lung carcinomas, myc genes are amplified [M.D. Cole, Ann. 
Rev. Genet. , 20:361-384 (1986)]. Paradoxically, under 
certain conditions myc can induce apoptosis, a regulated 
cell suicide process [D.S. Askew et al , Oncogene . 6:1915- 

20 1922 (1991); G.I. Evan et al , Cell , 69:119-128 (1992)]. 

However, loss or suppression of apoptosis is an important 
step in the malignant conversion of human tumors 
containing deregulated myc oncogenes, including, 
prominently, prostate carcinoma [T. G. Strohmeyer et al, 

25 J. Urol. . 151:1479-1497 (1994)]. 

There remains a need in the art for compositions and 
methods of regulating a deregulated Myc protein and of 
exploiting and/or diagnosing its apoptotic potential. 

Brief Description of the Drawings 
30 Fig. 1 is a partial ' murine cDNA sequence SEQ ID N0:1 

and the murine Bin! polypeptide encoded thereby SEQ ID 
NO: 2. 
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Fig- 2A-2C is a human cDNA sequence SEQ ID NO: 3 and 
the human Binl polypeptide encoded thereby SEQ ID NO: 4. 

Fig. 3A is a bar chart illustrating the selective 
requirement of the Myc-binding domain (MBD) for Myc 
5 inhibition, as described in Example 7. The data 

represent three to seven trials for each transfection. 
The data are depicted as the percent of foci induced by 
oncogenes and vector, as appropriate. 

Fig. 3B is a bar chart illustrating the dominant 
10 inhibitory activity of MBD. 

Fig. 4 is a bar chart illustrating that Binl vectors 
selectively inhibit colony formation in HepG2 cells 
lacking endogenous expression. The data are depicted as 
the percentage of colonies obtained with empty vector. 

15 summary t.he Invention 

in one aspect, the present invention provides a 
partial murine cDNA clone of a Box-dependent myc- 
interacting polypeptide 1 (Binl) , formerly referred to as 
c-Myc interacting peptide (MIP or MIP-99), SEQ ID NO-.l, 
20 and the polypeptide encoded thereby, SEQ ID N0:2. 

In another aspect, the present invention provides a 
human Binl cDNA clone, SEQ ID NO: 3, and the human 
polypeptide encoded thereby, SEQ ID NO: 4. 

in yet another aspect, the present invention 
25 provides a vector comprising a mammalian nucleic acid 
sequence encoding a Binl protein and a host cell 
transformed by such a vector. Alternatively, this vector 
may be used in gene therapy applications. 

In still another aspect, the invention provides an 
30 oligonucleotide probe comprising a nucleic acid sequence 
as defined herein. Also provided is an antibody raised 
against a Binl protein or peptide thereof. 
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In yet a further aspect, the present invention 
provides a diagnostic reagent for breast or liver cancer, 
or deficient Binl production, comprising an 
oligonucleotide probe or an antibody of the invention. 
5 Further provided is a therapeutic reagent comprising 

a polypeptide, anti-idiotype antibody, or gene therapy 
vector of the invention. 

Still another aspect of the invention provides a 
method of treating breast or liver cancer by 
10 administering a therapeutic reagent of the invention. 
Other aspects and advantages of the present 
invention are described further in the following detailed 
description of the preferred embodiments thereof. 

Detailed Description of the Invention 

15 The present invention provides novel, isolated, 

nucleic acid sequences which encode novel proteins which 
interact with c-Myc and bind thereto, fragments of these 
sequences and antibodies developed thereto. The nucleic 
acid sequences, protein sequences and antibodies are 

2 0 useful in the detection, diagnosis and treatment of 

cancers or other disorders associated with deregulation, 
deficiency or amplification of the c- mvc oncogenes. 
Further, when a Box-dependent myc-interacting polypeptide 
1 (called Binl) of this invention binds to c-Myc, the 

2 5 binding appears to regulate the c-Myc and result in tumor 
suppression, by inhibiting cell growth and/or 
facilitating apoptosis (programmed cell death) . The Binl 
gene has several other features suggesting it is a tumor 
suppressor gene. First, Binl inhibits Myc-dependent 

JO malignant cell transformation. Second, Binl is 

structurally related to RVS167, a negative regulation of 
the cell division cycle in the yeast Saccharomvces 
cerevisiae [F. Bauer et al, Mol. Cell. Biol. . 13.:5070- 
5084 (1993)]. Third, Southern analysis of the Binl gene 
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reveals that it is mutated in a significant portion of 
human liver carcinoma cell lines. Fourth, Northern 
analysis indicates that expression of Binl RNA is lost in 
human liver and breast carcinoma cell lines. Fifth, 
5 chromosomal mapping has identified Binl's location at 

2ql4, a frequent site of deletion in metastatic prostate 
cancers [W. Isaacs, Johns Hopkins Medical School, 
personal communication] and radiation-induced leukemias 
[I. Hayata et al, r^er Res. . 43:367-373 (1983)]. All 
10 of these features support the assignment of Binl as a 

tumor suppressor gene, similar to the breast cancer gene 
BRCAl, and the genes encoding p53 and the Rb 
retinoblastoma protein, which are negative regulators of 
cell growth that are observed to be mutated and/or 
15 unexpressed in human cancer cells. These aspects of the 
invention are discussed in more detail below. 

I. Nucleic Acid Se quences 

The present invention provides mammalian nucleic 
acid sequences encoding a Box-dependent myc-interacting 
20 polypeptide 1, termed herein Binl. The nucleic acid 

sequences of this invention are isolated from cellular 
materials with which they are naturally associated. In 
one embodiment, a Binl nucleic acid sequence is selected 
from all or part of the partial murine cDNA clone, SEQ ID 
25 NO: 1. In another embodiment, a Binl nucleic acid 

sequence is selected from all or part of a human cDNA 
clone, SEQ ID NO: 3. In yet another embodiment, the 
present invention provides a partial Binl genomic 
sequence, SEQ ID NO: 6. However, the present invention 
3 0 is not limited to these nucleic acid sequences. 

Given the sequences of SEQ ID NO: 1, SEQ ID NO: 3, 
and SEQ ID NO: 6, one of skill in the art can readily 
obtain the corresponding anti-sense strands of these cDNA 
and genomic sequences. Further, using known techniques, 
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one of skill in the art can readily obtain further 
genomic sequences corresponding to these cDNA sequences 
or the corresponding RNA sequences, as desired. 

Similarly the availability of SEQ ID NOS : 1, 3 and 6 
5 of this invention permits one of skill in the art to 

obtain other species Binl analogs, by use of the nucleic 
acid sequences of this invention as probes in a 
conventional technique, e.g. , polymerase chain reaction. 
Allelic variants of these sequences within a species 

10 (i.e., nucleotide sequences containing some individual 
nucleotide differences from a more commonly occurring 
sequence within a species, but which nevertheless encode 
the same protein) such as other human variants of Binl 
SEQ ID NO: 3, may also be readily obtained given the 

15 knowledge of this sequence provided by this invention. 

The present invention further encompasses nucleic 
acid sequences capable of hybridizing under stringent 
"conditions [see, J. Sambrook et al , Molecular Cloning; A 
Laboratory Manual , 2d ed. , Cold Spring Harbor Laboratory 

20 (1989)] to the sequences of SEQ ID NO: 1, SEQ ID NO: 3, 
SEQ ID NO: 6, their anti-sense strands, or biologically 
active fragments thereof. An example of a highly 
stringent hybridization condition is hybridization at 
2XSSC at 65 °C, followed by a washing in 0.1XSSC at 65 °C 

25 for an hour. Alternatively, an exemplary highly 

stringent hybridization condition is in 50% formamide, 
4'XSSC at 42 °C. Other, moderately high stringency 
conditions may also prove useful, e.g. hybridization in 
4XSSC at 55° C, followed by washing in 0.1XSSC at 37°C for 

30 an hour. Alternatively, an exemplary moderately high 

stringency hybridization condition is in 50% formamide, 
4XSSC at 30°C. 

Also encompassed within this invention are fragments 
of the above-identified nucleic acid sequences. 

35 Preferably, such fragments are characterized by encoding 
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a biologically active portion of Binl, e.g., an epitope. 
Generally, these oligonucleotide fragments are at least 
15 nucleotides in length. However, oligonucleotide 
fragments of varying sizes may be selected as desired. 
5 Such fragments may be used for such purposes as 

performing the PCR, e.g., on a biopsied tissue sample. 
For example, one fragment which is anticipated to be 
particularly useful is the Src homology 3 (SH3) domain, 
which is located at about nucleotides 891-1412 of SEQ ID 
10 NO: 3 (which encode amino acid residues 278-451 of SEQ ID 
NO: 4) . Preliminary data has indicated this domain may 
be useful in blocking apoptosis. Other useful fragments 
include about nucleotides 813-854 of SEQ ID NO: 3 
(encoding a nuclear localization signal, amino acid 
15 residues about 252-265 of SEQ ID NO: 4) , nucleotides 

about 867-908 (a Myc-binding domain or MBD amino acids 
270-283) . Other fragments and other uses of such 
fragments are discussed in more detail below. 

The nucleotide sequences of the invention may be 
2 0 isolated by conventional uses of polymerase chain 

reaction or cloning techniques such as those described in 
obtaining the murine and human sequences, described 
below. Alternatively, these sequences may be constructed 
using conventional genetic engineering or chemical 
25 synthesis techniques. 

According to the invention, the nucleic acid 
sequences [SEQ ID NO: 1, SEQ ID NO: 3, and SEQ ID NO: 6] 
may be modified. Utilizing the sequence data in these 
figures and in the sequence listing, it is within the 
skill of the art to obtain other polynucleotide sequences 
encoding the proteins of the invention. Such ' 
modifications at the nucleic acid level include, for 
example, modifications to the nucleotide sequences which 
are silent or which change the amino acids, e.g. to 
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improve expression or secretion. Also included are 
allelic variations, caused by the natural degeneracy of 
the genetic code. 

Also encompassed by the present invention are 
5 mutants of the Binl gene provided herein. Such mutants 
include amino terminal , carboxy terminal or internal 
deletions which are useful as dominant inhibitor genes. 
Such a truncated, or deletion, mutant may be expressed 
for the purpose of inhibiting the activity of the full- 

10 length or wild-type gene. For example, it has been found 
that expression of the partial murine Binl provided 
herein [SEQ ID NO: 2] acts in a dominant inhibitory manner 
to suppress normal Binl activity. Expression of this 
protein is described in Example 4 below. Another mutant 

15 encodes Binl deleted in the region encoding the MBD 

domain (amino acid residues 270-383 of SEQ ID NO: 4) . 

The invention further provides the complete human 
Binl gene, which has been cloned as a 35-45 kb contiguous 
sequence from a lambda phage genomic library. The DNA 

20 sequence of approximately 19 Jcb (about the 3 1 half) of 
the approximately 40 kb Binl gene has been determined 
[SEQ ID NO: 6]. More detailed discussion of the Binl 
genomic sequence is provided in Example 3 . The exon- 
intron junction sequences derived are desirable for 

25 applying PCR technology to identify mutations in DNA 

derived from tumor biopsies, using techniques similar to 
those applied to sequences derived from other tumor 
suppressor genes (e.g., p53 and BRCA1) . The sequenced 
region of the Binl gene spans regions previously found to 

3 0 be rearranged in liver and cervix carcinoma cell lines, 
making it possible to map deletions and possible 
mutations in primary human tumor DNA by PCR technology. 
Using the genomic clones, the human Binl gene has been 
mapped to chromosome 2ql4, a region frequently deleted in 

35 prostate carcinoma and in radiation-induced malignancies. 
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These nucleic acid sequences are useful for 
variety of diagnostic and therapeutic uses. 
Advantageously, the nucleic acid sequences are useful in 
the development of diagnostic probes and antisense probes 
for use in the detection and diagnosis of conditions 
characterized by deregulation or amplification of c-myc. 
The nucleic acid sequences of this invention are also 
useful in the production of mammalian, and particularly, 
murine and human Binl proteins. 



10 II- Protein Sequences 

The present invention also provides mammalian Binl 
polypeptides or proteins. These proteins are free from 
association with other contaminating proteins or 
materials with which they are found in nature. In one 
15 embodiment, the invention provides a partial murine Binl 
[SEQ ID NO: 2] polypeptide of 135 amino acids having a 
predicted molecular weight (MW) of 13,688. In another 
embodiment, the invention provides a full-length human 
Binl [ SEQ ID NO: 4] of 451 amino acids with an estimated 
20 MW of 50,048. The apparent MW of human Binl on sodium 
dodecyl sulfate polyacrylamide (SDS-PA) gels is 
approximately 67 kD. 

Comparisons of the Binl amino acid sequence to the 
DNA database were performed using the search algorithm 
25 BLAST [S.F. Altschul et al , J. Mol . Biol., 215:403-410 
(1990)]. Using the complete sequence to search the 
database, two known genes were identified which had 
highly significant similarity to the terminal regions of 
Binl (p < 10 8 ) . The first gene was amphiphysin, a 
30 neuronal protein of unknown function which is the 

putative autoimmune antigen in breast cancer-associated 
Stiff -Man syndrome [F. Folli et al, W. F.nq. J. Med. , 
328 :546-551 (1993)], a paraneoplastic disorder that 
clinically presents in a fraction of breast cancer 
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patients. The second gene was RVS167, a negative 
regulator of the cell division cycle in S . cerevisiae . 
The region of the most extensive similarity between 
amphiphysin and RVS167, approximately 50% and 25% , 
5 respectively, lies within residues 1-222 of Binl [SEQ ID 
N0:4]- Therefore, this N-terminal region of Binl has 
been termed herein the BAR domain (for 

Binl/amphiphysin/RVS167-related domain) . The extensive 
similarity of the BAR domains in these proteins suggest a 

10 common molecular function. Moreover, the relationship 
suggests roles for Binl in breast malignancy, where Myc 
is frequently involved, and in cell cycle regulation. 
Finally, since RVS167 is a negative regulator which is 
dispensible for cell growth but required for cell cycle 

15 exit, the similarity to RVS167 would be consistent with 
the likelihood that Binl is a tumor suppressor. 

To gain additional insights into the molecular 
functions of Binl, additional BLAST searches were 
performed with subsections of the Binl sequence. These 

20 searches identified several gene products which all 
function in regulation of cell cycle transit and/or 
chromosomal structure. Several additional relationships 
were revealed within the Binl BAR domain. These included 
pericentrin (30% identical; 46% similar; P<0.01), a 

25 centromere-binding protein required for proper chromosome 
organization during the cell cycle M phase; mitosin (24% 
identical; 48% similar; P=0.02), a protein implicated in 
transit through M phase; and SMC1 (21% identical; 43% 
similar; P=0.05), a yeast regulator of M phase chromosome 

30 segregation. In the scoring range where these 

similarities were observed, highly alpha helical regions 
of non-muscle myosin, tropomyosin, and the trp gene 
product were also found, suggesting that the BAR domain 
shares their highly helical structure. Between the C- 

3 5 terminal end of the BAR region and the nuclear 
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localization signal (NLS; amino acids 252-265, SEQ ID NO: 
4) lies an additional Binl domain (amino acids 224-251, 
SEQ ID NO: 4), encoded by a single exon, which is not 
found in amphiphysin and RVS167 but which also contains 
motifs seen in proteins controlling cell cycle and 
chromosome structure. One -10 amino acid motif is found 
in a functionally important region of the SV40 T antigen 
oncoprotein, while a second motif is seen in RED1, a 
yeast protein implicated in chromosome segregation. 
Proximal to these motifs is an additional motif which is 
similar to p93disl, another yeast protein implicated in 
chromosome segregation. Taken together, these 
observations strengthen the likelihood that Binl 
participates in some aspect of cell cycle regulation and 
15 further suggests a role in chromosome structure control. 

Further encompassed by this invention are fragments 
of the Binl polypeptides. Such fragments are desirably 
characterized by having Binl biological activity, 
including, e.g., the ability to interact with c-Myc. 
These fragments may be designed or obtained in any 
desired length, including as small as about 8 amino acids 
in length. Such a fragment may represent an epitope of 
the protein. One particularly desirable fragment is 
located at amino acid residues 270-383 of SEQ ID NO: 4, 
which is the c- Myc binding domain (MBD) . Another 
desirable fragment is located at residues 278-451 of SEQ 
ID NO: 4 and is a Src homology 3 (SH3) domain. A third 
fragment is located at residues 223-251 of SEQ ID NO: 4 
and includes the T antigen/REDl/ P 93disl motifs discussed 
above. Yet another desirable fragment includes the BAR 
domain, located at amino acid residues 1-222 of SEQ ID 
NO: 4. Finally, a fragment containing the nuclear 
localization domain located at amino acid residues 252 to 
about 265 of SEQ ID NO: 4, may also be desirable. 
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Also included in the invention are analogs, or 
modified versions, of the proteins provided herein. 
Typically, such analogs differ by only one to four codon 
changes. Examples include polypeptides with minor amino 
5 acid variations from the illustrated amino acid sequences 
of Binl (Figs. 1 and 2; SEQ ID NO: 2 and 4) ; in 
particular, conservative amino acid replacements. 
Conservative replacements are those that take place 
within a family of amino acids that are related in their 

10 side chains and chemical properties. Also provided are 
homologs of the proteins of the invention which are 
characterized by having at least 85% homology with SEQ ID 
NO: 2 or SEQ ID NO: 4. It has previously determined that 
the murine and human Binl (in partial) are about 88.5% 

15 identical . 

Additionally, the Binl proteins [SEQ ID NO: 2 and 4] 
of the invention may be modified, for example, by 
truncation at the amino or carboxy termini , by 
elimination or substitution of one or more amino acids, 

2 0 or by any number of now conventional techniques to 

improve production thereof, to enhance protein stability 
or other characteristics, e.g. binding activity or 
bioavailability, or to confer some other desired property 
upon the protein. 

25 III. Expression 

A. In Vitro 

To produce recombinant Binl proteins of this 
invention, the DNA sequences of the invention are 
inserted into a suitable expression system. Desirably, a 

3 0 recombinant molecule or vector is constructed in which 

the polynucleotide sequence encoding Binl is operably 
linked to a heterologous expression control sequence 
permitting expression of the murine or human Binl 
protein. Numerous types of appropriate expression 
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vectors are known in the art for mammalian (including 
human) protein expression, by standard molecular biology 
techniques. Such vectors may be selected from among 
conventional vector types including insects, e.g., 
5 baculovirus expression, or yeast, fungal, bacterial or 
viral expression systems. Other appropriate expression 
vectors, of which numerous types are known in the art, 
can also be used for this purpose* 

Methods for obtaining such expression vectors 
10 are well-known. See, Sambrook et al , Molecular Cloning. 
A Laboratory Manual . 2d edition, Cold Spring Harbor 
Laboratory, New York (1989); Miller et al, Genetic 
Engineering , 8:277-298 (Plenum Press 1986) and references 
cited therein. 
15 Suitable host cells or cell lines for 

transfection by this method include mammalian cells, such 
as Human 293 cells, Chinese hamster ovary cells (CHO) , 
the monkey COS-1 cell line or murine 3T3 cells derived 
from Swiss, Balb-c or NIH mice may be used. Another 
20 suitable mammalian cell line is the CV-1 cell line. 

Still other suitable mammalian host cells, as well as 
methods for transfection, culture, amplification, 
screening, and product production and purification are 
known in the art. [See, e.g., Gething and Sambrook, 
25 Nature, 293:620-625 (1981), or alternatively, Kaufman et 
al, Mnl . Cell. Biol. , 5(7) :1750-1759 (1985) or Howley et 
al, U. S. Patent 4,419,446]. 

Similarly bacterial cells are useful as host 
cells for the present invention. For example, the 
30 various strains of E. coli (e.g., HB101, MC1061, and 

strains used in the following examples) are well-known as 
host cells in the field of biotechnology. Various 
strains of B. subtilis . Pseudomonas, other bacilli and 
the like may also be employed in this method. 
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Many strains of yeast cells known to those 
skilled in the art are also available as host cells for 
expression of the polypeptides of the present invention. 
Other fungal cells may also be employed as expression 
5 systems* 

Alternatively, insect cells such as Spodoptera 
frugjpedera (Sf9) cells may be used. 

Thus, the present invention provides a method 
for producing a recombinant Binl protein which involves 

10 transfecting a host cell with at least one expression 

vector containing a recombinant polynucleotide encoding a 
Binl protein under the control of a transcriptional 
regulatory sequence, e.g. by conventional means such as 
electroporation. The transfected host cell is then 

15 cultured under suitable conditions that allow expression 
of the Binl protein. The expressed protein is then 
recovered, isolated, and optionally purified from the 
culture medium (or from the cell, if expressed 
intracellularly) by appropriate means known to one of 

20 skill in the art. 

For example, the proteins may be isolated in 
soluble form following cell lysis, or may be extracted 
using known techniques, e.g., in guanidine chloride. If 
desired, the Binl proteins of the invention may be 

25 produced as a fusion protein. For example, it may be 
desirable to produce Binl fusion proteins, to enhance 
expression of the protein in a selected host cell, to 
improve purification, or for use in monitoring the 
presence of Binl in tissues, cells or cell extracts. 

30 Suitable fusion partners for the Binl proteins of the 

invention are well known to those of skill in the art and 
include, among others, £-galactosidase , glutathione-S- 
transf erase, and poly-histidine. 
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B. In Vivo 

Alternatively, where it is desired that the 
Binl protein be expressed in vivo, e.g., for gene therapy 
purposes, an appropriate vector for delivery of Binl, or 
5 fragment thereof (such as the SH3 domain) , may be readxly 
selected by one of skill in the art. Exemplary gene 
therapy vectors are readily available from a variety of 
academic and commercial sources, and include, e.g., 
adeno-associated virus [International patent application 
10 No PCT/US91/03440] , adenovirus vectors [M. Kay et al , 

t^. w>.tl. «»i - 91:2353 (1994); S. Ishibashi 

et al, t n-Mn. Tnvest. . 92:883 (1993)], or other viral 
vectors,, e.g., various poxviruses, vaccinia, etc. 
Methods for insertion of a desired gene, e.g. Binl, and 
obtaining in vivo expression of the encoded protein, are 
well known to those of skill in the art. 



15 



20 



IV. Antisera and Antibodies 

The Binl proteins of this invention are also useful 
as antigens for the development of anti-Binl antisera and 
antibodies to Binl or to a desired fragment of a Binl 
protein. Specific antisera may be generated using known 
techniques. See, Sambrook, cited above, Chapter 18, 
generally, incorporated by reference. Similarly, 
25 antibodies of the invention, both polyclonal and 

monoclonal, may be produced by conventional methods, 
including the Kohler and Milstein hybridoma technique, 
recombinant techniques, such as described by Huse et al , 
science , 246:1275-1281 (1988), or any other techniques 
known to the art. For example, rabbit polyclonal 
antisera was developed and recognizes an epitope (s) 
between amino acid residues 190-250 of SEQ ID NO: 4. 
This antisera has been found to be human-specific. Since 
amino acids 190-250 are outside the MBD and SH3 domain, 
for experimental needs two additional antisera have been 
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raised to these regions. The immunogens included human 
Binl amino acids 270-383 (MBD) [SEQ ID NO: 4] or amino 
acids 278-451 (SH3) [SEQ ID NO: 4]. Each antisera has 
been shown to recognize the appropriate domain by 
5 immunoprecipitation. 

Additionally, six (6) Binl-specific monoclonal 
antibodies have been characterized, termed 99-D through 
99-1. The approximate location of the epitopes within 
Binl for each antibody has been mapped. MAb 99D 
10 recognizes an epitope within amino acids 190-250 [SEQ ID 
NO: 4]; MAbs 99F-99I recognize epitopes within the NjLS 
(amino acids 252-261 [SEQ ID NO: 4]); MAb 99E recognizes 
a complex epitope requiring amino acids 190-250 and amino 
acids 263-397 [SEQ ID NO: 4]. Each antibody has been 
15 isotyped and demonstrated to work in immunoprecipitation, 
Western blotting, and immunohistochemistry methodology. 
Particularly, MAb 99D and MAb 99F are IgG2b isotypes; 
MAbs 9E, 99G and 99H are IgGl isotypes. Further, MAbs 
99D and 99F have been determined to be useful for 
2 0 immunohistochemistry with sectioned biopsy tissue and 
tissue culture cells, and are therefor likely to be 
useful for clinical applications to analyze tumor 
biopsies. 99D recognizes a nuclear protein present in 
all normal cells examined so far but missing in carcinoma 
25 cells previously demonstrated to lack Binl RNA. 99F has 
been determined to specifically recognize a cytoplasmic 
form of Binl which is induced following muscle 
differentiation in an in vitro model system which is 
described below, 99D recognizes both the cytoplasmic as 
30 well as the nuclear forms of Binl. 99D has been 

determined to be effective for detecting Binl protein by 
standard Western methodology in nonionic detergent 
lysates of a wide variety of tissues and tissue culture 
cells. 99D and 99F also have been shown to recognize 
35 both murine and human Binl polypeptides. 
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Also encompassed within this invention are humanized 
and chimeric antibodies. As used herein, a humanized 
antibody is defined as an antibody containing murine 
complementary determining regions (CDRs) capable of 
5 binding to Binl or a fragment thereof, and human 

framework regions. These CDRs are preferably derived 
from a murine monoclonal antibody (MAb) of the invention. 
As defined herein, a chimeric antibody is defined as an 
antibody containing the variable region light and heavy 
10 chains, including both CDR and framework regions, from a 
Binl MAb of the invention and the constant region light 
and heavy chains from a human antibody. Methods of 
identifying suitable human framework regions and 
modifying a MAb of the invention to contain same to 
15 produce a humanized or chimeric antibody of the 

invention, are well known to those of skill in the art. 
See, e.g., E . Mark and Padlin, "Humanization of 
Monoclonal Antibodies", Chapter 4, The. Handbook of 
FvnP.rimentai Pharmacology , Vol. 113, The Pharmacology of 
Monoclonal Antibodies, Springer-Verlag (June, 1994). 
Other types of recombinant ly-designed antibodies are also 
encompassed by this invention. 

Further provided by the present invention are anti- 
idiotype antibodies (Ab2) and anti-anti-idiotype 
antibodies (Ab3) . Ab2 are specific for the target to 
which anti-Binl antibodies of the invention bind and Ab3 
are similar to Binl antibodies (Abl) in their binding 
specificities and biological activities [see, e.g., M. 
Wettendorff et al., "Modulation of anti-tumor immunity by 
ant i- idiotypic antibodies." In Idiotypic Network and 
Diseases , ed. by J. Cerny and J. Hiernaux J, Am. Soc. 
Microbiol., Washington DC: pp. 203-229, (1990)]. These 
anti-idiotype and anti-anti idiotype antibodies may be 
produced using techniques well known to those of skill in 
the art. Such anti-idiotype antibodies (Ab2) can bear 
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the internal image of the c-Myc and bind to it in much 
the same manner as Binl and are thus useful for the same 
purposes as Binl. 

In general, polyclonal antisera, monoclonal 
5 antibodies and other antibodies which bind to Binl as the 
antigen (Abl) are useful to identify epitopes of Binl, to 
separate Binl from contaminants in living tissue (e.g., 
in chromatographic columns and the like) , and in general 
as research tools and as starting material essential for 

10 the development of other types of antibodies described 
above. Anti-idiotype antibodies (Ab2) are useful for 
binding c-Myc and thus may be used in the treatment of 
cancers in which c- Myc is part of a biochemical cascade 
of events leading to tumor formation. The Ab3 antibodies 

15 may be useful for the same reason the Abl are useful. 
Other uses as research tools and as components for 
separation of c-Myc from other contaminant of living 
tissue, for example, are also contemplated for these 
antibodies . 

2 0 V. Diagnostic Reagents and Methods 

Advantageously, the present invention provides 
reagents and methods useful in detecting and diagnosing 
abnormal levels of Binl, and particularly deficiencies or 
excess production thereof, in a patient. As defined 
25 herein, a deficiency of Binl is defined as an inadequate 
amount of Binl to compensate for the levels of c-Myc in a 
patient. Conditions associated with deficiencies of Binl 
include a variety of cancers, e.g., breast cancer, liver 
cancer and colon cancer, and hyperplastic disease states, 

3 0 e.g. , benign prostate hyperplasia. 

Thus, the proteins, protein fragments, antibodies, 
and polynucleotide sequences (including anti-sense 
polynucleotide sequences and oligonucleotide fragments) , 
and Binl antisera and antibodies of this invention may be 
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useful as diagnostic reagents. These reagents may 
optionally be labelled using diagnostic labels, such as 
radioactive labels, coloriroetric enzyme label systems and 
the like conventionally used in diagnostic or therapeutic 
5 methods. Alternatively, the N- or C-terminus of Binl or 
a fragment thereof may be tagged with a viral epitope 
which can be recognized by a specific antisera. The 
reagents may be used to measure abnormal Binl levels in 
selected mammalian tissue in conventional diagnostic 
10 assays, e.g., Southern blotting, Northern and Western 
blotting, polymerase chain reaction (PCR) , reverse 
transcriptase (RT) PCR, immunostaining, and the like. 
For example, in biopsies of tumor tissue, loss of Binl 
expression in tumor tissue could be directly verified by 
15 RT-PCR or immunostaining. Alternatively, a Southern 
analysis, genomic PCR, or fluorescence in situ 
hybridization (FISH) may be performed to confirm Binl 
gene rearrangement . 

In one example, as diagnostic agents the 
20 polynucleotide sequences may be employed to detect or 

quantitate normal Binl. The selection of the appropriate 
assay format and label system is within the skill of the 
art and may readily be chosen without requiring 
additional explanation by resort to the wealth of art in 
25 the diagnostic area. 

Thus the present invention provides methods for the 
detection of disorders characterized by insufficient Binl 
levels. Currently, it is anticipated that antibodies of 
the invention, such as 99D and 99F, which have been found 
30 to be able to withstand the conditions necessary for 
tissue fixation, will be particularly useful for 
biopsies. However, the protein, antibody, antisera or 
polynucleotide reagents of the invention are expected to 
be similarly useful in the following methods. The 
3 5 methods involve contacting a selected mammalian tissue, 
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e.g., a biopsy sample or other cells, with the selected 
reagent, protein, antisera antibody or DNA sequence, and 
measuring or detecting the amount of Binl present in the 
tissue in a selected assay format based on the binding or 
5 hybridization of the reagent to the tissue. 

VI . Therapeutic Compositions and Methods 

Compositions and methods useful for the treatment of 
conditions associated with inadequate Binl levels are 
provided. As stated above, included among such 

10 conditions are liver, colon and breast cancers and 
hyperplastic disease states. Also provided are 
compositions and methods for inhibiting Binl activity in 
order to ameliorate a condition in which apoptosis is 
activated and Binl plays a role. Such conditions may 

15 include degenerative conditions, e.g., neurodegenerative 
diseases. 

The therapeutic compositions of the invention may be 
formulated to contain an anti-idiotype antibody of the 
invention, or the Binl protein itself or a fragment 

20 thereof may be administered to mimic the effect of normal 
Binl and bind c-Myc, thereby preventing its cancer 
causing function. For example, one particularly useful 
protein may be the Binl SH3 domain (amino acids 378-451 
of SEQ ID NO: 4) . These compositions may contain a 

25 pharmaceutically acceptable carrier. Suitable carriers 
are well known to those of skill in the art and include, 
for example, saline. Alternatively, such compositions 
may include conventional del ivery systems into which 
protein of the invention is incorporated. Optionally, 

30 these compositions may contain other active ingredients, 
e.g. , chemotherapeutics. 

Still another method involves the use of the Binl 
polynucleotide sequences for gene therapy. In the 
method, the Binl sequences are introduced into a suitable 
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vector for delivery to a cell containing a deficiency of 
Binl and/or to block tumor growth* By conventional 
genetic engineering techniques, the Binl gene sequence 
may be introduced to mutate the existing gene by 
5 recombination or to replace an inactive or missing gene. 

The dose, timing and mode of administration of these 
compositions may be determined by one of skill in the 
art. Such factors as the age, condition, and the level 
of the Binl deficiency detected by the diagnostic methods 
10 described above, may be taken into account in determining 
the dose, timing and mode of administration of the 
therapeutic compositions of the invention. Generally, 
where treatment of an existing cancer or hyperplastic 
state is indicated, a therapeutic composition of the 
15 invention is preferably administered in a site-directed 
manner and is repeated as needed. Such therapy may be 
administered in conjunction with conventional therapies, 
including radiation and/or chemotherapeutic treatments. 

The following examples illustrate the isolation and 
2 0 use of the Binl sequences of the invention. These 

examples are illustrative only and do not limit the scope 
of the invention. 

Example 1 - Identification and Characterization of Binl 
A. Murine Binl cDNA 

25 a yeast two hybrid approach [Fields, S. and 0. 

Song., Nature, 340:245-6 (1989)] was used to screen for 
Myc- interacting proteins (Binl) in a murine embryonic 
cDNA library. The cDNA library was derived from day 10.5 
mouse embryonic RNA [A. B. Vojtek et al , Cell, 74: 

30 205-214 (1993)]. This system takes advantage of the 
modular nature of transcription factors, whose 
DNA-binding and transcriptional activating components can 
be assembled in trans by interacting protein (IP) domains 
derived from other polypeptides. A previously described 
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two hybrid system [Vojtek et al, cited above] and a 16 
amino acid nontransactivating polypeptide derived from 
the human c-Myc " Myc box 1" (MB1) region [Prendergast, 
G.C. and E.B. Ziff, Trends in Genet. , 8: 91-96.3 (1992)] 
5 EDIWKKFELLPTPFLS (human c-Myc amino acids 47-62) [SEQ ID 
NO: 5], were used as "bait" in the screen. 

Briefly, the "bait" plasmid contained a TRP1 
marker and a LexA-MBl fusion protein as the DNA binding 
component, and the cDNA library vector, pVP16, contained 

10 a LEU 2 marker and the herpes simplex virus VP16 protein 
as the transcriptional transactivator fused to the cDNA 
library inserts. cDNA synthesized from the 10.5 day 
murine embryo RNA was size-selected by random DNasel 
treatment to -0.05 kb, treated with Klenow enzyme, NotI 

15 linked, and subcloned into pVP16. This cDNA library was 
designed to express protein modules whose interactions 
might be occluded in full-length polypeptides. The yeast 
strain L40 (MAT a tr-pl-901 leu2-3,112 LYS2 : : (lexAop) 4 — 
HIS3 URA3: : (lexAop) 8-lacZ) served as the host for the two 

20 hybrid screen [see, Vojtek et al, cited above]. 

An L4 0 derivative expressing the MB1 "bait" was 
transfected with the cDNA library and approximately 3 x 
10 7 TRP+LEU+ transf ormants were examined in the primary 
screen, 300-4 00 of which were also the HIS+LacZ+ 

25 phenotype, which is diagnostic for interaction between 
the "bait" and library components [Vojtek et al , cited 
above] . The clones were cured of the original "bait" 
plasmid by standard methods [Guthrie, C. and G.R. Fink, 
eds., Guide to Yeast Genetics and Molecular Biology, 

30 Meth. Enzvmol . , 194 , Academic Press: New York (1991)]. 

One hundred clones cured of the bait plasmid were tested 
for interaction by a mating strategy with a set of test 
baits . 

The test "baits" included the original lexA-MBl 
3 5 peptide construct, a set of negative controls that 
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included no insert, lamin [A. B. Vojtek et al, Cell, 
74:205-214 (1993)3, the small GTP-binding protein RhoB 
[D. Jahner, Mm . cell. Biol. , 11:3682-3690 (1991)], the 
peptide FTRHPPVLTPPDQEVI [SEQ ID NO: 7] derived from rat 
5 protein kinase Q32 , a mutant MB1 peptide, a similarly 

sized but nonspecific peptide derived from protein kinase 
C, or lamin. The protein kinase C (PKC) peptide 
contained a phosphorylation site structurally analogous 
to the MB1 T58 phosphorylation site, which is recognized 
10 by glycogen synthase kinase-3 (GSK-3) , a kinase present 
in yeast. The PKC peptide was designed to control for 
binding proteins that might non-specif ically interact 
with phosphooligopeptides (e.g., peptidases, kinases, 
phosphatases). MB1 specificity was reproducibly 
15 exhibited by 14/99 of the original yeast clones. 

cDNA library plasmids were shuttled from the 
desired clones to E. coli [Guthrie et al , cited above] 
and the DNA sequence of the inserts was determined. All 
clones contained related or identical sequences of 
20 approximately 0.4 kb containing an open reading frame 
(ORF) of 135 amino acids encoding a Myc-interacting 
polypeptide, termed Binl [ SEQ ID NO : 2 ] , which exhibited 
specificity for Myc. 



25 



B. K^nterlal expressio n of murine Binl polypeptide 
r gren Tn no: 21 *s a solu ble GST fusion protein 
To study the association of the 135 aa murine 
Binl polypeptide [SEQ ID NO: 2] with Myc in vitro , the 
'0.4 kb cDNA [SEQ ID NO:l] was expressed as a 
glutathione-S-transferase (GST) fusion protein and used 
30 in binding assays with 35 S-methionine-labeled in vitro 

translated (IVT) proteins. The binding experiments were 
configured essentially as described in A. K. Rustgi et 
al, Nature, 352:541-544 (1991). 
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To construct the GST fusion protein, the murine 
cDNA insert on a Clal-EcoRI fragment was substituted for 
a similar fragment in pE47 [C. Murre et al, Cell , 56:777- 
783 (1989)], making pATG-99. The pATG-99 ORF included an 
5 initiator methionine, added a 15 amino acid N-terminal 

extension (3 amino acids from E47 and 12 amino acids from 
VP16) to the 135 residue clone #9 9 ORF, and retained the 
translational termination site derived from the two 
hybrid vector. Expression of the ATG99 polypeptide was 

10 confirmed by in vitro translation from pATG-99. The 
pATG-99 insert was then subcloned into pGEX-2T 
(Pharmacia) and the recombinant plasmid introduced into 
E. coli . GST-99 polypeptide was expressed and purified 
from E. coli cell extracts on glutathione-Sepharose 

15 (Pharmacia) , using protocols supplied by the vendor. 

Twenty (20) fil (~0.5 fig) of purified GST-99 
protein was analyzed on an SDS-PA gel fixed and stained 
with Coomassie Blue . The apparent molecular weight (MW) 
of the Binl component of the fusion (22 3cD) is larger 

2 0 than the predicted MW (14 kD) but is consistent with the 
apparent MW of in vitro translated murine Binl [SEQ ID 
NO: 2]. 

C. In Vitro Association of Mvc and Binl fSEO ID 
NO: 21 

25 [ 35 S] -methionine labeled c-Myc polypeptides 

were generated by IVT using TNT rabbit reticulocyte 
lysates (Promega) . Expression plasmids included CMV Hm 
[G. C. Prendergast et al , Cell , j65:395-407 (1991)]; CMV 
Hm subclones containing MB1 deletion amino acids 49-101 

30 [J. Stone et al f Mol . Cell. Biol « , 7:1697-1709 (1987)]; 
MB2 deletion amino acids 12 0-140 [ L. Li et al, EMBO J. , 
L3:4070-4079 (1994)], or both deletions; the adenovirus 
E1A vectors pl2S, pl3S; and the SV40 large T antigen 
vector pTag [unpublished data]; and CMV-USF [ L. Li, cited 

35 above] . 
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Approximately 2.5 fig of GST or GST-99 and 10 Ml 
of an IVT reaction were added to 0.5 ml binding buffer 
(10 mM TrisCl pH 7.5, 5 mM EDTA, 500 mM NaCl, 0.25% NP40) 
incubated 1 hr at 4°C on a nutator shaker, washed four 
5 times with binding buffer, and analyzed by SDS-PAGE and 
fluorography. c-Myc (but none of the other polypeptides 
produced by IVT) exhibited association with GST-99. 

D . Association of Binl r sro ID NO; 7.1 with TBP but 
not USF 

10 [ 35 s] -labeled TBP and USF were generated by IVT 

and tested for GST-99 binding as in C. above. 
Reinforcing the notion that it might be involved in MBl 
function in transcriptional regulation by Myc, Binl bound 
to TATA-binding protein [TBP, a critical component of the 

15 basal transcription apparatus] . Other polypeptides that 
were tested for GST-9 9 interaction and found to be 
negative included Max, cell cycle protein pl07 , 
transcription factor YY1, extracellular protein PAI-1, 
small GTP-binding protein RhoB, and empty-vector- 

20 associated products. Taken together, these findings 

argued that the association between GST-99 and Myc was 
both specific and physiologically relevant, since it 
depended upon the presence of the Myc boxes. 

Fvam ple 2 - Isolation of Human Binl cDNA 

25 BLAST searches of the complete DNA sequence database 

[GenBank] with the murine Binl sequence showed no strong 
similarities to known genes, but revealed an 
approximately 89% identity to an 289 bp uncharacterized 
human "expressed sequence tag". This finding suggested 

30 that Binl represented a novel gene conserved and 
expressed in humans. 

Northern analysis of RNA from several human tissues 
using a murine Binl cDNA [SEQ ID NO: 1] as probe revealed 
a single RNA species of "2.2 kb that was abundant in 
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skeletal tissue. A 1.95 kilobase human Binl cDNA was 
obtained from a human skeletal muscle AZAPII cDNA library 
(Stratagene, La Jolla, CA) by standard methods [Sambrook 
et al 7 cited above], using the murine Binl probe, i.e., 
5 by hybridization with [ 32 P] -labeled clone #99 insert and 
washing under low stringency conditions (2 x SSC 42 °C). 
The complete sequence of this "2.0 kb full-length cDNA, 
p99f , was determined [SEQ ID NO: 3] using the dideoxy 
method with Sequenase (US Biochemicals) and assembled and 

10 analyzed with MacVector software (IBI/Kodak) . DNA 

database comparisons were performed using BLAST software. 
The subcloned cDNA contained a 451 amino acid ORF with 
approximately 88% identity to a C-terminal region of 
murine Binl. The human ORF was therefore designated 

15 human Binl [SEQ ID NO: 4]. 

Example 3 - Human Binl Gene Isolation, Structure and 
Regulation 

A. DNA sequencing of the human Binl gene 
Genomic clones of human Binl have been 

2 0 obtained. A 40 Kb contiguous sequence composed of five 

lambda phage genomic inserts has been assembled which 
contains the entire Binl gene. Approximately 15kb of the 
gene sequence is provided in SEQ ID NO: 6. Eight 
contiguous exons from the C-terminal BAR region to the 
25 SH3 domain have been identified. Three additional N- 

terminal BAR exons have also been identified. Five other 
exons identified by DNA sequence analysis algorithms 
appear in alternatively spliced RNAs found to be 
expressed exclusively in brain. With reference to the 

3 0 features information provided with respect to SEQ ID NO: 

6, the nine exon sequences correspond to the following 
Binl cDNA sequences [SEQ ID NO: 3]: 623-655 (partial 
sequence of BAR region exon); 656-731 (3 1 BAR region 
exon) ; 732-814 (Ul region exon) ; 815-859 (NLS) ; 860-1004 
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(U2 region exon) ; 1005-1094 (5« MBD region exon) ; 1095- 
1205 (3' MBD region exon); 1206-1307 (5« SH3 domain 
region exon); 1308-1925 (3- SH3 domain/3' untranslated 

region [UTR] exon) . 

Using the genomic clones, the human Binl gene 
has been mapped to chromosome 2ql4. This region is 
within a mid-2q locus that has been reported to be 
deleted in approximately 50% of metastatic prostate 
carcinomas. The region of the murine genome syntenic to 
human 2ql4 has also been reported to be deleted in >9 0% 
of radiation induced leukemias and lymphomas. These data 
strengthen the previous assertion that Binl may be 
encoded by a novel tumor suppressor gene. 

B . increase in Binl Levels Purina Muscle and 
15 Neuronal Differ entiation 

Binl RNA has been found to be present in brain 
and muscle cells at 10- to 100-fold higher levels than 
other tissues, a feature shared with cell cycle kinase 
inhibitors (CKIs) . Since these cells are postmitotic and 
2 0 Binl had been shown to block Myc's ability to induce cell 
cycle progression, it is possible that upregulation of 
Binl has a role in cell cycle exit associated with cell 
differentiation. To begin to assess this possibility, 
Binl expression was examined using in vitro model systems 
2 5 for differentiation of muscle cells (murine C2C12 

premyoblast cells) [L. Silberstein et al, Cell, 46:1075- 
1081 (1986)3 and neurons (rat PC12 pheochromocytoma 
cells) [L. A. Greene and A.S. Tischler, Proc. Natl. Acad. 
Sci. USA . 73:2424-2428 (1976)]. , 
30 T his analysis revealed that both Binl RNA and 

protein are regulated during cell differentiation. Binl 
RNA levels were increased following induction of cell 
differentiation in C2C12 or PC12 cells, by serum deprival 
or nerve growth factor (NGF) addition, respectively. In 
3 5 untreated PC12 cells, three transcripts of "1.3, "2.4, 
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and "2.9 kb were noted. Within 5 days of NGF treatment 
the level of the "2.9 kb RNA was increased several-fold, 
concomitant with neurite extension, while the level of 
the other two RNAs decreased to undetectable levels. The 
5 nature of the "1.2 kb transcript, which was most abundant 
in untreated cells, was unclear but its unusually small 
size suggested the possibility that it was truncated due 
to mutation (PC12 was derived from a rat adrenal gland 
tumor). In C2C12 cells, a single "2.4 kb transcript 

10 noted increased "2 0-fold within 5 days of serum deprival, 
concomitant with myotube formation. These observations 
suggest that Binl may be involved in cell cycle 
regulation during neuronal and muscle cell 
differentiation. 

15 C. Identification of a larger Binl Polypeptide in 

Differentiated Muscle Cells 

Western analysis with the 99D antibody 
confirmed an increase in Binl expression and revealed the 
presence of a slightly larger Binl polypeptide generated 

20 3 days post-induction. Levels of the smaller Binl 

polypeptide detected in undifferentiated cells was found 
to remain constant while the larger species increased 
dramatically. Indirect immunofluorescence using 99D 
antibody was used to examine the cell localization of 

25 Binl during C2C12 differentiation. Binl staining was 

found to change from a strictly nuclear pattern to whole 
cell pattern including the cytoplasm. The 99F antibody 
was found to detect only the larger polypeptide and stain 
only the cytoplasm (did not stain the nuclear protein) . 

3 0 Thus, the larger Binl polypeptide induced during 

differentiation is completely confined to the cytoplasm. 
A dominant negative genetic approach is being taken to 
determine whether induction of the large Binl species is 
necessary for cell cycle exit during differentiation. 
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g^m pl* 4 - construction o f Mammalian Expression Vectors 
a nH Timnunoprec -ipitation Techniques 

Viral vectors for delivering Binl into insect, 
rodent and human cells have been developed for various 
purposes, including therapeutic purposes and to permit 
high-level Binl protein production and efficient gene 

transfer. 

A. Baculoviral Vector 

-1.6 kb EcoRI fragment containing the complete 
Binl coding region was inserted into the baculovirus 
recombination vector pVL1393 (Invitrogen, Inc., San 
Diego, CA), generating pBacBin. Sf9 insect cells were 
cotransfected with pBacBin and a plasmid encoding a 
defective baculovirus which cannot propagate. Rare 
recombination between these two plasmids in vivo leads to 
generation of a lytic recombinant baculovirus which can 
be propagated. Virus produced in cultures of 
cotransfected cells was propagated in mass Sf9 culture. 
Binl production was verified by Western analysis of NP40 
ly sates prepared 2 4 and 4 8 hr after infection of Sf9 
cells infected with the BacBin virus, using 99D 
monoclonal antibody. 

B. Adenovi ra 1 vector 

The strategy and plasmid vector systems to 
25 produce recombinant adenovirus has been described [K. 
Kozarsky et al , Onin. Genet. Dev., 3:499-503 

(1993)]. Similar to the approach taken to make 
baculoviral vectors, two plasmids are used which contain 
complementary regions which can homologously recombine in 
30 vivo. Recombinant virus is produced only in transfected 
cells where recombination has taken place. The plasmid 
P AdCMV P AT153 is used to introduce the gene of interest. 
P AdCMV P AT153 contains the left 6% of the adenovirus 
serotype 5 genome, modified such that the El region is 
3 5 replaced with a cytomegalovirus (CMV) early region 
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enhancer/promoter, multiple cloning site, and a G418 
resistance gene cassette. Included in the cell 
transfection with this vector is a -34 kb Clal-digested 
fragment of adenovirus type 5 DNA that includes the 
5 remainder of the adenoviral genome. This fragment 

contains a mutation in the E3 region which ablates the 
immune response in adenovirus-inf ected animals [T. 
Ranheim et al, J . Virol . . 67: 2159-2167 (1993)]. This 
feature was incorporated into the recombinant virus to 
10 increase the persistence and therefore the potential 

efficacy of Binl-based gene therapy approaches. The cell 
host for transfection is human 293 cells, an epithelial 
line which expresses the El region gene products required 
for propagation of recombinant adenoviruses. 
15 The plasmid pAdenoBin was generated by 

inserting a -1.6 kb EcoRI fragment containing the 
complete Binl coding region into the multiple cloning 
site of P AdCMVpAT153. 293 cells cotransf ected with 
pAdenoBin and the Clal-digested adenoviral DNA fragment 
were subjected to G418 selection and screening and 
purification by plaque assay (recombinant viruses are 
lytic in 293 cells). DNA isolated from a Binl virus 
identified in this manner will be validated by Southern 
analysis to confirm that the Binl cDNA is intact. 

These vectors are particularly well suited for 
use in human therapies. 

c - Moloney retroviral vector 

A recombinant Binl retrovirus was generated 
using methods that have been described [N. Landau et al, 
30 J. Virol. , 66:5110-5113 (1992)]. The Binl plasmid vector 
pSRaMSV-Binl was generated by inserting the -1.6 kb EcoRI 
fragment containing the complete Binl coding region into 
pSRaMSV, a retroviral vector that lacks RNA packaging 
signals and includes a G418 resistance gene cassette. 
3 5 Briefly, recombinant virus was isolated from the media of 



20 



25 



WO 96/34627 



PCT/OS96/06231 



30 



10 



COS monkey cells cotransf ected with pSRaMSV-Binl and 
pSVf-E-MLV, a proviral vector which provides the 
necessary retroviral packaging components. Recombinant 
virus were used to infect Rati fibroblasts and infected 
cell populations were selected by G418 selection. 
Expression of recombinant Binl in the Rati cell 
populations was confirmed by Northern and Western 
analysis. 

Although the procedure above generated 
ecotropic Binl retroviruses limited to gene transfer to 
murine cells, those with skill in the art can easily 
generate amphotropic retroviruses that can transfer Binl 
to human, cells. This is achieved by simply 
cotransfecting COS cells with pSRaMSV-Binl and pSVT-A- 
15 MLV, a packaging vector which encodes an amphotropic 

instead of ecotropic envelope glycoprotein [N. Landau et 
al, cited above]. Such vectors have been applied for use 
in gene therapies to attack human cancers. 
D. Mammalian ExnreP ^nn Vectors 
20 Binl mammalian cell expression vectors were 

constructed as follows and were used to generate the Binl 
proteins used in the following experiments. CMV-Binl was 
generated by subcloning a 1.6 kb EcoRI fragment from the 
full-length human Binl cDNA clone, P 99f, that contained 
25 the entire predicted Binl coding sequence into pcDNA3 
(Invitrogen) , a mammalian cell expression vector that 
contains a cytomegalovirus enhancer/promoter and a 3 ' 
polyadenylation signal. CMV-HA-Binl was constructed by 
substituting a PvuII-EcoRI coding region fragment from 
30 CMV-Binl for an EcoRV-EcoRI fragment of neoCMV-hem rhoA, 
a RhoA expression plasmid that included an 8 residue N- 
terminal viral hemagglutinin (HA) epitope recognized by 
the monoclonal antibody 12CA5 [H. Niman et al, Proc_ 
M»f.1. Acad. Sci. USA , 80:4949-4953 (1983)]. TheHA-Binl 
3 5 polypeptide created included residues 1-47 from the N- 
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terminus of RhoA [Yeraiaian et al, Nucl. Acids Res> , 
15.: 1869 (1987)] and residues 52-451 of Binl [SEQ ID NO: 
4]. This protein fusion added an N-terminal extension to 
Binl that allowed immunoprecipitation by anti-HA antibody 
5 12CA5 [H. Niman et al , cited above], CMV-BinlAMBD 

deleted amino acid residues 270-377 [of SEQ ID NO: 4] in 
CMV-Binl. It was constructed by ligating two separate 
PCR fragments generated by the 5 1 primer 

CCGGATCCGCGATGCTCTGGAACGTGGTGACG [nucleotides 60-80 of 

10 SEQ ID NO: 3] and the 3* primer GCGAATTCGTTGTCACTGTTC 
TTCTTTCTGCG (fragment encoding aa 1-269) [nucleotides 
866-842, corresponding to the antisense strand of SEQ ID 
NO: 3] and the 5' primer CGGAATTCACCATGGGTTTCATGTTC 
AAGGTACAG [nucleotides 1191-1211 of SEQ ID NO; 3] and the 

15 3 1 primer CCGCTCGAGTCATGGGACCCTCTCAGTGAAGTT (fragment 
encoding aa 378-451) [nucleotides 1415-1392, 
corresponding to the antisense strand of SEQ ID NO: 3]. 
This construction added the nonspecific amino acids EFTM 
at the fusion junction due to the restriction site added. 

20 E . Immunoprecipitation 

COS, MCF7 , and IMR9 0 cells were cultured in 
Dulbecco's modified Eagle's media (DMEM) supplemented 
with 10% fetal bovine serum (Sigma) and 50 U/ml each 
penicillin and streptomycin (Fisher) . Cells were 

25 transfected by a modified calcium phosphate protocol [C. 
Chen et al, Mol. Cell . Biol. . 7:2745-2752 (1987)] and 
metabolically labeled 4 8 hr later. Rabbit antisera was 
raised to a GST fusion protein including amino acid 
residues 189-398 of Binl (GST-99Pst) [SEQ ID NO: 4], that 

30 included all of the MBD, using a commercial service 

(Rockland, Inc., Boyerstown, PA). Ten microliters of 
crude antisera or prebleed sera was used for 
immunoprecipitations from IMR90 or COS cells 
metabolically labeled 2-4 hr in DMEM media lacking 

35 methionine and cysteine (Gibco) with 75-125 /xCi/ml 
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EXPRESS labeling reagent (NEN) , washed with ice-cold 
phosphate-buffered saline, and extracted for 20 min on 
ice with RIPA buffer containing the protease inhibitors 
leupeptin, aprotinin, phenylmethylsulf onyl fluoride, and 
5 antipain [E. Harlow et al, Antibodies: A Laboratory 

Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY (1988)]. Cell lysates were precleared by 
centrifugation at 20,000 g for 15 min at 4'C followed by 
1 hr treatment with prebleed sera and 2 0 n± of a 1:1 

10 slurry of protein G sepharose beads at 4°C on a nutator 
(Pharmacia) . Precleared lysates were immunoprecipitated 
90 min at 4'C and then additional protein G beads were 
added and the incubation an additional 30 min- Beads 
were collected by brief centrifugation, washed four times 

15 with RIPA buffer, boiled in SDS gel loading buffer, 
fractionated on 10% gels, and f luorographed. 

To establish that the Binl cDNA encoded a 
polypeptide similar to that found in normal cells, 
metabolically labeled extracts from IMR90 normal human 

2 0 diploid fibroblasts were subjected to 

immunoprecipitation. The results are described in 
Example 5 below. 

Example 5 - Characteri zation of Binl 

A polyclonal antiserum was raised to a bacterially- 

25 expressed polypeptide derived from the unique central 
region of Binl, in order to reduce the chance of 
crossreaction with Binl-related proteins. When incubated 
with metabolically labeled extracts from COS cells 
transfected with CMV-Binl, this antisera 

30 immunoprecipitated two polypeptides with apparent MW 70 
kD and 45 3cD. Each polypeptide was specifically 
recognized because their immunoprecipitation could be 
blocked by preincubating antisera with a molar excess of 
GST-Binl immunogen but not with unfused GST. In COS 
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cells transfected with CMV-HA-Binl , only the 70 kD 
polypeptide was immunoprecipitated by an anti-HA 
monoclonal antibody. The IVT product from the full- 
length cDNA also had an apparent mobility of 70 kD. 
5 These data indicated that the 7 0 kD species was Binl and 
suggested that the 45 kD species was a Binl-related 
polypeptide. Cells transfected with CMV-BinlAMBD, a Binl 
deletion construct lacking the central Myc-binding domain 
(amino acid 270-377 of SEQ ID NO: 4), exhibited stable 

10 accumulation of a polypeptide whose predicted and 

apparent MW were both 42 kD. This result indicated that 
full-length Binl migrated aberrantly due to an MBD 
determinant at 70 kD in SDS polyacrylamide gels, instead 
of at the predicted MW of 50 kD. Only the 45 kD 

15 polypeptide was detected in untransf ected MCF7 breast 
tumor cells, which lacked Binl RNA, or in cells 
transfected with empty vector. Thus, the 4 5 kD species 
was not a coprecipitant or an alternately processed or 
degraded form of Binl. Consistent with its assignment as 

20 a Binl-related protein, the 45 kD polypeptide could be 
detected by Western blotting. 

Example 6 - Immunofluorescence Studies 

-5 x 10 3 HepG2 cells were seeded onto glass cover 
slips in 6 cm dishes and the next day transfected 

2 5 overnight with 4 fig CMV-Binl or pcDNA3 . Two days later 
cells were washed and processed for immunofluorescence 
essentially as described [G. Prendergast et al, EMBO J. , 
iO: 757-766 (1991)], using 5 \xg of protein A Sepharose- 
purified anti-Binl IgG and a 1:1000 dilution of 

30 f luorescein-conjugated anti-rabbit IgG (Cappel) as the 
secondary antibody. Stained cover slips were examined 
and analyzed on a Leitz conf ocal microscope . 

In this manner, cell localization was examined by 
indirect cell immunofluorescence of transiently 
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transfected cultures of HepG2 hepatocarcinoma cells, 
which like MCF7 cells lack detectable Binl RNA (see 
below) and therefore provided an internal control for any 
crossreacting polypeptides. HepG2 cells transfected with 
5 CMV-Binl but not vector exhibited a speckled nuclear 
pattern of staining. The nuclear localization was 
consistent with the presence of a NLS in the primary 
sequence of Binl and with a nuclear site of interaction 
with Myc. 

10 BvnTiinle 7 - TnMbifc j nrt of Mvc Oncogenic Activity by Binl 
Since Binl was identified on the basis of its 
interaction with MB1, which is implicated in Myc 
transformation activity [J. Stone et al, Mol . Cell. 
Biol. , 1:1697-1709 (1987) and B. Pulverer et al, 
15 oncogene , 9:59-70 (1994)], the effects of Binl and the 
MBD deletion mutant BinlAMBD (Example 4) were tested on 
cell transformation by Myc, adenovirus E1A, and SV4 0 T 
antigen in the Ras cooperation assay [H. Land et al, 
Nature, 304:596-602 (1983)] performed in primary rat 
2 0 embryo fibroblasts (REFs) . Since the original clone #99 
cDNA was partial and encoded essentially only the MBD, it 
was anticipated that the clone #99 ORF might act in a 
dominant negative manner to interfere with either 
endogenous Binl. Therefore, the effects of a clone #99 
25 expression vector (Example 4) on Myc transformation were 
also tested. 

The -0.5 kb murine cDNA [SEQ ID NO: 1] engineered 
with a 5' Kozak initiator methionine from pATG-99 was 
subcloned into pcDNA3 (a CMV enhancer/promoter vaccine; 
30 invitrogen, San Diego, CA) to generate neoCMV-ATG9 9 . REF 
culture and transf ectio.n was performed essentially as 
described [G. Prendergast et al , Genes Dev. , 6:2429-2439 
(1992)]. Briefly, secondary passage REFs seeded into 10 
dishes were transfected overnight by a calcium 
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phosphate coprecipitation method [C. Chen et al, cited 
above] with 5 Atg each of the oncogene plasmids and 10 fig 
of other plasmids indicated, then passaged 1:5 the next 
day and fed with normal growth media until foci were 
5 scored by methanol fixation and crystal violet staining 
12-14 days later. In some experiments, 0.5 mg/ml G418 
was added the day after passaging. The following 
oncogene plasmids were used in REF assays. LTR Hm, which 
contains a Moloney long terminal repeat-driven normal 

10 human c- mvc gene, and pT22, which contains an activated 
H-ras gene, have been described [H. Land et al, cited 
above and A. Kelekar et al , Mol . Cell. Biol. , .6:7-14 
(1986)]. A nontrans forming Myc frameshift mutant (MycFS) 
was constructed by digestion of LTR Hm with a unique Bst 

15 Ell in exon 2 of the human c- mvc gene, filling with 

Klenow enzyme, and self ligation. The MycFS polypeptide 
encoded by this mutant, LTR Hm/Bst, is frameshifted at 
amino acid residue 104, eliminating its biological 
function. This frameshift mutant was included to 

2 0 establish that the augmentation of foci formation by CMV— 
ATG99 was Myc-dependent . In some control experiments, 
NeoCMV T and plA/neo, encoding SV4 0 T antigen and 
adenovirus E1A, respectively, were substituted for LTR 
Hm. Transformed foci were scored two weeks later. 

2 5 The results of the REF focus formation experiments 

are shown in Fig. 3. On its own or with activated ras , 
Binl lacked transforming activity- However, when 
cotransf ected with myc and ras , Binl selectively 
inhibited focus formation ~7-fold. Inhibition could be 

3 0 titered by decreasing the ratio of Binl to myc and ras 

Vectors in the transfected DNA (data not shown) . In 
contrast to the effect of full-length Binl, but 
consistent with a dominant inhibitory effect, the murine 
vector neoCMV-ATG99 specifically augmented focus 
35 formation -2- to 4-fold when cotransf ected with myc and 
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ras. Binl also inhibited ElA-dependent transformation, 
consistent with the fact that E1A and Myc function 
similarly in biological assays [G. Evan et al, Cell , 
6£:119-128 (1992); H. Land et al, cited above; H. Ruley, 
5 Nature . 304 :602-606 (1983); and L. Rao et al, Proc. Natl. 
&r.ad. Sci. USA . 89:7742-7746 (1992)]. However, Binl did 
not affect T antigen-dependent transformation. This 
result indicated that the inhibition of Myc and E1A was 
not due to toxicity or nonspecific inhibition of the 
10 transformed phenotype. Notably, BinlAMBD significantly ^ 
inhibited E1A but not Myc. The lack of an effect of 
BinlAMBD on Myc transformation could not be explained by 
protein instability, because BinlAMBD had been shown to 
stably accumulate in transfected COS cells and could 
15 inhibit E1A transformation. Although the means by which 
Binl and BinlAMBD inhibited E1A was unclear, an important 
implication of this result was that Binl inhibited E1A 
and Myc by different mechanisms, an interpretation 
consistent with the differential binding of these 
20 oncoproteins to the MBD represented in GST-99 . 

Supporting the notion that Binl was incompatible with Myc 
or E1A transformation, exogenous Binl message accumulated 
in REF cell populations derived from transformation with 
T antigen but not with Myc or E1A; in contrast, BinlAMBD 
25 message accumulated in REFs transformed by either Myc or 
E1A. There is a possibility that a reduced activity of 
BinlAMBD revealed intrinsic differences in the 
sensitivity of E1A and Myc to Binl inhibition. However, 
with this caveat, it was concluded that Binl 
30 physiologically interacted with and inhibited Myc, since 
deletion of a Binl domain sufficient for association in 
vitro was necessary for its inhibition activity in vivo . 
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Example 8 - Northern Analysis 

Northern analysis was performed to examine Binl 
expression patterns. Total cytoplasmic RNA from tumor 
cell lines was isolated [G. Prendergast et al , MoL Cell. 
5 Biol. , 9:124-134 (1989)] and hybridized to [ 32 P] -labeled 
Binl cDNA probes [G. Church et al , Proc. Natl. Acad. Sci. 
USA , 81:1991-1995 (1984)]. 

Ubiquitous expression in normal murine and human 
cells was observed. In the mouse, RNA levels were 

10 highest in embryo, adult brain, and adult muscle but 

lower levels were seen in all other tissues examined. In 
embryo and brain, at least two transcripts could be 
resolved, suggesting alternate splicing or differential 
usage of initiation or polyadenylation sites in some 

15 cells. In human cells, RNA levels were similar in WI-38 
normal diploid fibroblasts and tumor cells derived from 
several different tissues. However, Binl message levels 
were undetectable in HepG2 hepatocarcinoma and MCF7 
breast carcinoma cells and were >10-fold reduced in SK- 

20 CO-l colon carcinoma cells. Further examination revealed 
similar deficits in 5/6 breast and 3/6 cervix carcinomas, 
and in 4/7 liver and 1/2 lung carcinomas. In total, loss 
of Binl message was observed in 14/27 carcinoma cell 
lines examined. 

25 Example 9 - Colony Formation Assays 

The functional significance of deficits in Binl 
. message levels in certain tumor cells (as in Example 8) 
was suggested by G418-resistant cell colony formation 
experiments performed in four cell lines available from 
30 the American Type Culture Collection (Rockville, 

Maryland) that either contained (HeLa) or lacked (HepG2, 
MCF7, SAOS-2) endogenous Binl RNA. 

Colony formation assays were performed in the 
following manner. 3 x 10 5 cells in 3 cm dishes were 
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transfected overnight with 2 ptg CMV-Binl (described in 
Example 4) or an empty vector, using Lipof ectamine 
(Gibco/BRL) * Cells were passaged 4 8 hr after 
transfection at a 1:10 ratio into 6 cm dishes containing 
5 media with -0.6 mg/ml G418, which permits selection for 
the neomycin gene present on each plasmid. Drug- 
resistant cell colonies were scored by crystal violet 
staining 2-3 weeks later- At least three trials for each 
cell line were performed and colonies were scored in 
10 triplicate dishes. 

HepG2, MCF7, and SAOS-2 cells transfected with a 
Binl vector exhibited approximately 3-fold fewer colonies 
relative to cells transfected with empty vector, whereas 
no significant difference in HeLa colony formation was 
15 seen (Fig. 4) . Cell populations derived from pooled 
colonies which emerged from Binl-transf ected HepG2 
cultures showed no evidence of expression, when examined 
by immunoprecipitation, consistent with an 
incompatibility with cell growth. From this data, it can 
20 be concluded that the RNA deficits seen in carcinoma 
cells are functionally significant and that Binl can 
inhibit tumor cell growth, consistent with a tumor 
suppressor function . 

Kvam ple 10 - Rearrangement and lo ss of expression of the 
25 Binl aene in 1 iver and breast cancer c ells 

Because Binl had been demonstrated to inhibit 
Myc-dependent cell transformation and tumor cell growth, 
the following study was performed to determine if the 
Binl gene is mutated in human tumor cells. The initial 
30 experiment was to perform Southern analysis of the 
genomic DNA from a panel of human tumor cell lines 
including HeLa [cervix, ATCC CCL 2], SK-CO-1 [colon, ATCC 
HTB 39], HT-29 [colon, ATCC HTB 38], DU145 [prostate, 
ATCC HTB 41], PC-3 [prostate, ATCC CRL 1435], LNCaP 
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[prostate, ATCC CRL 1740]; T24 [bladder, ATCC HTB4 ] ; MCF7 
[breast, ATCC HTB 22]; HepG2 [liver, ATCC HB 8065]; Rh-30 
[myosarcoma, E.C. Douglass et al, "A specific chromosomal 
abnormality in rhabdosarcoma, Cvtoaenet. Cell Genet. , 
5 45:148-155 (1987)]; Ra j i [lymphoma, ATCC CCL 86]. DNA 
from WI-38 normal diploid fibroblasts [ATCC CCL 75] was 
used as a source of normal DNA. 

DNAs were isolated by standard methods (Sambrook et 
al, cited above) and 5 fig per sample was treated with 

10 Hindlll restriction endonuclease . Restricted DNA was 

fractionated on a 0.65% agarose gel which was denatured 2 
x 15 minutes in 1.5 M NaCl/0.5M NaOH, neutralized 2 x 30 
minutes in 1.5 M NaCl/0.5 TrisCl pH 8, and then blotted 
to a charged nylon membrane (Stratagene, La Jolla CA) . 

15 The blot was crosslinked by UV irradiation and hybridized 
in a commercial hybridization solution with a 
random-primed 32 P-labeled Binl cDNA probe according to 
the vendor's instructions (Amersham, Cambridge UK). The 
blot was washed 1 x 10 minutes with 2X SSC/0.1% SDS at 

20 20 °C and then 2 x 10 minutes with 0.2 SSC/0.1% SDS at 
65 °C before being exposed to X-ray film (DuPont, 
Wilmington DE) . 

Two bands of >20 kb and 6.5 kb were observed in all 
the genomic DNAs except for HepG2 , a liver carcinoma cell 

25 line, where an additional band of "3.5 kb was seen. 

Following this observation, a second Southern analysis 
was performed on a panel of 9 liver carcinoma cell lines, 
including Huhl, Huh2 , HepG2 [ATCC HB8065], Hep3B [ATCC 
HB8064], Hep43, Hep63 , HLF [ATCC CCL 199], NCH2 , and 

30 NHep40 (provided by Dr. D. Simon, Medical College of 

Pennsylvania) . Conditions were the same as above except 
that Pstl restriction endonuclease was used. 

Five bands of 2.5, 1.8, 1.5, 0.95, and 0.75 kb were 
observed in WI-3 8 normal DNA. Four of the nine liver 

35 tumor DNAs (HepG2 , Hep3B, NCH2 , and NHep40) exhibited an 
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30 



additional band of 2.9-3,3 kb. These data corroborated 
the previous results and indicated that Binl may be 
mutated during the development of human hepatocarcinoma. 
Northern analysis on RNA isolated from the initial 
5 panel of human tumor cells, including HepG2 liver 
carcinoma cells, was performed to examine Binl 
expression. A similar analysis of RNAs isolated from 
mouse embryo or adult tissues was also performed. Total 
cytoplasmic RNA was purified by standard methods 
10 (Sambrook et al, cited above) and 15 fig was fractionated 
on a 1% formaldehyde agarose gel and blotted as described 
[G.C. Prendergast and M.D. Cole, Mol . Cell. Biol., 9: 
124-134 (1989)]. A commercial Northern blot containing 
RNA from normal human brain, heart, kidney, lung, liver, 
15 skeletal muscle, pancreas, and placenta (Clontech, Palo 

Alto CA) was also analyzed. Using the same procedure and 
conditions as above, the Northern blots were hybridized 
with Binl cDNA probe, washed, and exposed to X-ray film. 
A Binl-specif ic "2.2 kb RNA was observed in all 
20 tissues and cell lines except HepG2 and the breast 

carcinoma cell line MCF7 . This result indicated that 
Binl was ubiquitously expressed and that mutation in 
HepG2 cells was correlated with loss of expression. This 
result directly supports the utility of RT-PCR and FISH 
25 for diagnosing Binl loss in liver and breast cancer 
biopsies. 

All documents cited above are incorporated herein by 
reference. Numerous modifications and variations of the 
present invention are included in the above- identified 
specification and are expected to be obvious to one of 
skill in the art. Such modifications and alterations to 
the compositions and processes of the present invention 
are believed to be encompassed in the scope of the claims 
appended hereto. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Wistar Institute of Anatomy & Biology 

(ii) TITLE OF INVENTION: Murine and Human Box-Dependent 
Myc-Interacting Protein (BIN1) and Uses Therefor 

(iii) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howson and Howson 

(B) STREET: Spring House Corporate Cntr, P O Box 4 

(C) CITY: Spring House 

(D) STATE: Pennsylvania 

(E) COUNTRY: USA 

(F) ZIP: 19477 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1,0, Version #1-30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: WO 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/43 5,454 

(B) FILING DATE: 05-MAY-1995 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Bale, Mary E. 

(B) REGISTRATION NUMBER: 31,215 

(C) REFERENCE /DOCKET NUMBER: WST60APCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-540-92 00 

(B) TELEFAX: 215-540-5818 



(2) INFORMATION FOR SEQ ID NO:l: 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 02 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : unknown 
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ATG GGT GTG AAG GAG AGC GAC TGA 
Met Gly Val Lys Glu Ser Asp 
130 



42 



84 



(ii) MOLECULE TYPE: cDNA 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1-.399 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAG ATC AGA GTG AAC CAT GAG CCA GAG CCG GCC AGT GGG GCC 
Glu lie Arg Val Asn His Glu Pro Glu Pro Ala Ser Gly Ala 
! 5 10 

TCA CCC GGG GCT GCC ATC CCC AAG TCC CCA TCT CAG CCA GCA 
Ser Pro Gly Ala Ala lie Pro Lys Ser Pro Ser Gin Pro Ala 
15 20 25 

GAG GCC TCC GAG GTG GTG GGT GGA GCC CAG GAG CCA GGG GAG 12 6 

Glu Ala Ser Glu Val Val Gly Gly Ala Gin Glu Pro Gly Glu 
30 35 . 40 

ACA GCA GCC AGT GAA GCA ACC TCC AGC TCT CTT CCG GCT GTG 168 
Thr Ala Ala Ser Glu Ala Thr Ser Ser Ser Leu Pro Ala Val 
45 50 55 

GTG GTG GAG ACC TTC TCC GCA ACT GTG AAT GGG GCG GTG GAG 210 
Val Val Glu Thr Phe Ser Ala Thr Val Asn Gly Ala Val Glu 
60 65 70 

GGC AGC GCT GGG ACT GGA CGC TTG GAC CTG CCC CCG GGA TTC 252 
Gly Ser Ala Gly Thr Gly Arg Leu Asp Leu Pro Pro Gly Phe 

75 80 

ATG TTC AAG GTT CAA GCC CAG CAT GAT TAC ACG GCC ACT GAC 29 4 
Met Phe Lys Val Gin Ala Gin His Asp Tyr Thr Ala Thr Asp 
85 90 95 

ACT GAT GAG CTG CAA CTC AAA GCT GGC GAT GTG GTG TTG GTG 
Thr Asp Glu Leu Gin Leu Lys Ala Gly Asp Val Val Leu Val 
100 105 HO 

ATT CCT TTC CAG AAC CCA GAG GAG CAG GAT GAA GGC TGG CTC 
He Pro Phe Gin Asn Pro Glu Glu Gin Asp Glu Gly Trp Leu 
115 120 125 



336 



378 



402 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Glu lie Arg Val Asn His Glu Pro Glu Pro Ala Ser Gly Ala Ser 
15 10 15 

Pro Gly Ala Ala lie Pro Lys Ser Pro Ser Gin Pro Ala Glu Ala 

20 25 30 

Ser Glu Val Val Gly Gly Ala Gin Glu Pro Gly Glu Thr Ala Ala 

35 40 45 

Ser Glu Ala Thr Ser Ser Ser Leu Pro Ala Val Val Val Glu Thr 

50 55 60 

Phe Ser Ala Thr Val Asn Gly Ala Val Glu Gly Ser Ala Gly Thr 

65 70 75 

Gly Arg Leu Asp Leu Pro Pro Gly Phe Met Phe Lys Val Gin Ala 

80 85 90 

Gin His Asp Tyr Thr Ala Thr Asp Thr Asp Glu Leu Gin Leu Lys 

55 100 105 

Ala Gly Asp Val Val Leu Val He Pro Phe Gin Asn Pro Glu Glu 

110 115 120 

Gin Asp Glu Gly Trp Leu Met Gly Val Lys Glu Ser Asp 

125 130 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 
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(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 60.. 1412 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAATTCCGTG CTGGTTGAGC TTG CTCATCT CCTTGTGGAA GTTTTCCTCC 

AGGCCCGCG ATG CTC TGG AAC GTG GTG ACG GCG GGA AAG ATC 
Met Leu Trp Asn Val Val Thr Ala Gly Lys lie 
! 5 10 

GCC AGC AAC GTG CAG AAG AAG CTC ACC CGC GCG CAG GAG AAG 
Ala Ser Asn Val Gin Lys Lys Leu Thr Arg Ala Gin Glu Lys 



GTT CTC CAG AAG CTG GGG AAG GCA GAT GAG ACC AAG GAT GAG 
Val Leu Gin Lys Leu Gly Lys Ala Asp Glu Thr Lys Asp Glu 



40 45 50 

GAG GGC ACC CGG CTG CAG AAG GAT CTC CGG ACC TAC CTG GCC 
Glu Gly Thr Arg Leu Gin Lys Asp Leu Arg Thr Tyr Leu Ala 
55 60 65 

TCC GTC AAA GCC ATG CAC GAG GCT TCC AAG AAG CTG AAT GAG 
Ser Val Lys Ala Met His Glu Ala Ser Lys Lys Leu Asn Glu 
70 75 80 

TGT CTG CAG GAG GTG TAT GAG CCC GAT TGG CCC GGC AGG GAT 
III Leu Gin Glu Val Tyr Glu Pro Asp Trp Pro Gly Arg Asp 
1 85 90 95 

CAG GCA AAC AAG ATC GCA GAG AAC AAC GAC CTG CTG TGG ATG 
SS aS A^n Lys lie Ala Glu Asn Asn Asp Leu Leu Trp Met 

100 105 

GAT TAC CAC CAG AAG CTG GTG GAC CAG GCG CTG CTG ACC ATG 
Asp ?yr nil Gin Lys Leu Val Asp Gin Ala Leu Leu Thr Met 
110 H5 120 

GAC ACG TAC CTG GGC CAG TTC CCC GAC ATC AAG TCA CGC ATT 
Asp Thr Tyr Leu Gly Gin Phe Pro Asp He Lys Ser Arg He 
125 130 _ I 35 

GCC AAG CGG GGG CGC AAG CTG GTG GAC TAC GAC AGT GCC CGG 
Ala Lys Arg Gly Arg Lys Leu Val Asp Tyr Asp Ser Ala Arg 
140 145 



50 
92 

134 

176 



CAG TTT GAG CAG TGC GTC CAG AAT TTC AAC AAG CAG CTG ACG 218 
SS Phe SS Gin cys Val Gin Asn Phe Asn Lys Gin Leu Thr 



260 



302 



344 



386 



428 



470 



512 
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CAC CAC TAC GAG TCC CTT CAA ACT GCC AAA AAG AAG GAT GAA 554 
His His Tyr Glu Ser Leu Gin Thr Ala Lys Lys Lys Asp Glu 
155 160 165 

GCC AAA ATT GCC AAG GCC GAG GAG GAG CTC ATC AAA GCC CAG 59 6 

Ala Lys lie Ala Lys Ala Glu Glu Glu Leu lie Lys Ala Gin 

170 175 

AAG GTG TTT GAG GAG ATG AAT GTG GAT CTG CAG GAG GAG CTG 638 
Lys Val Phe Glu Glu Met Asn Val Asp Leu Gin Glu Glu Leu 
180 185 190 

CCG TCC CTG TGG AAC AGC CGC GTA GGT TTC TAC GTC AAC ACG 680 
Pro Ser Leu Trp Asn Ser Arg Val Gly Phe Tyr Val Asn Thr 
195 200 205 

TTC CAG AGC ATC GCG GGC CTG GAG GAA AAC TTC CAC AAG GAG 722 
Phe Gin Ser lie Ala Gly Leu Glu Glu Asn Phe His Lys Glu 
210 215 220 

ATG AGC AAG CTC AAC CAG AAC CTC AAT GAT GTG CTG GTC GGC 764 
Met Ser Lys Leu Asn Gin Asn Leu Asn Asp Val Leu Val Gly 
225 230 235 

CTG GAG AAG CAA CAC GGG AGC AAC ACC TTC ACG GTC AAG GCC 8 06 

Leu Glu Lys Gin His Gly Ser Asn Thr Phe Thr Val Lys Ala 

240 245 

CAG CCC AGA AAG AAA AGT AAA CTG TTT TCG CGG CTG CGC AGA 848 

Gin Pro Arg Lys Lys Ser Lys Leu Phe Ser Arg Leu Arg Arg 

250 255 260 

AAG AAG AAC AGT GAC AAC GCG CCT GCA AAA GGG AAC AAG AGC 890 

Lys Lys Asn Ser Asp Asn Ala Pro Ala Lys Gly Asn Lys Ser 
265 270 275 

CCT TCG CCT CCA GAT GGC TCC CCT GCC GCC ACC CCC GAG ATC 9 32 

Pro Ser Pro Pro Asp Gly Ser Pro Ala Ala Thr Pro Glu lie 
280 285 290 

AGA GTC AAC CAC GAG CCA GAG CCG GCC GGC GGG GCC ACG CCC 974 
Arg Val Asn His Glu Pro Glu Pro Ala Gly Gly Ala Thr Pro 
295 300 305 

GGG GCC ACC CTC CCC AAG TCC CCA TCT CAG CCA GCA GAG GCC 1016 
Gly Ala Thr Leu Pro Lys Ser Pro Ser Gin Pro Ala Glu Ala 

310 315 

TCG GAG GTG GCG GGT GGG ACC CAA CCT GCG GCT GGA GCC CAG 1058 
Ser Glu Val Ala Gly Gly Thr Gin Pro Ala Ala Gly Ala Gin 
320 325 330 
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GAG CCA GGG GAG ACT TCT GCA AGT GAA GCA GCC TCC AGC TCT 1100 
Glu Pro Gly Glu Thr Ser Ala Ser Glu Ala Ala Ser Ser Ser 
335 340 345 

CTT CCT GCT GTC GTG GTG GAG ACC TTC CCA GCA ACT GTG AAT 1142 
Leu Pro Ala Val Val Val Glu Thr Phe Pro Ala Thr Val Asn 
350 355 360 

GGC ACC GTG GAG GGC GGC AGT GGG GCC GGG CGC TTG GAC CTG 1184 
Glv Thr Val Glu Gly Gly Ser Gly Ala Gly Arg Leu Asp Leu 

* 370 3 ' 5 



365 



CCC CCA GGT TTC ATG TTC AAG GTA CAG GCC CAG CAC GAC TAC 122 6 
Pro Pro Gly Phe Met Phe Lys Val Gin Ala Gin His Asp Tyr 

380 385 

ACG GCC ACT GAC ACA GAC GAG CTG CAG CTC AAG GCT GGT GAT 1268 
Thr Ala Thr Asp Thr Asp Glu Leu Gin Leu Lys Ala Gly Asp 
390 395 400 

GTG GTG CTG GTG ATC CCC TTC CAG AAC CCT GAA GAG CAG GAT 1310 
Val val Leu Val He Pro Phe Gin Asn Pro Glu Glu Gin Asp 
405 410 415 

GAA GGC TGG CTC ATG GGC GTG AAG GAG AGC GAC TGG AAC CAG 1352 
Glu Gly Trp Leu Met Gly Val Lys Glu Ser Asp Trp Asn Gin 

4 25 h o \J 



420 



1394 



CAC AAG AAG CTG GAG AAG TGC CGT GGC GTC TTC CCC GAG AAC 
His Lys Lys Leu Glu Lys Cys Arg Gly Val Phe Pro Glu Asn 

435 440 44t> 

TTC ACT GAG AGG GTC CCA TGACGGCGGG GCCCAGGCAG CCTCCGGGCG 144 2 
Phe Thr Glu Arg Val Pro 

450 

TGTGAAGAAC ACCTCCTCCC GAAAAATGTG TGGTTCTTTT TTTTGTTTTG 1492 

TTTTCGTTTT TCATCTTTTG AAG AG C AAAG GGAAATCAAG AGGAGACCCC 154 2 

CAGGCAGAGG GGCGTTCTCC CAAAGTTTAG GTCGTTTTCC AAAGAGCCGC 1592 

GTCCCGGCAA GTCCGGCGGA ATTCACCAGT GTTCCTGAAG CTGCTGTGTC 164 2 

CTCTAGTTGA GTTTCTGGCG CCCCTGCCTG TGCCCGCATG TGTGCCTGGC 1692 

CGCAGGGCGG GGCTGGGGGC TG CCGAG CCA CCATACTTAA CTG AAG CTT C 174 2 

GGCCGCACCA CCCGGGGAAG GGTCCTCTTT TCCTGGCAGC TGCTGTGGGT 1792 
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GGGGCCCAGA CACCAGCCTA GCCTGCTCTG CCCCGCAGAC GGTCTGTGTG 1842 
CTGTTTGAAA ATAAATCTTA GTGTTCAAAA CAAAATGAAA CAAAAAAAAA 1892 
AATGATAAAA ACTCTCAAAA AAACAAGGAA TTC 1925 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 





(ii) MOLECULE 


TYPE: protein 
















(xi) SEQUENCE 


DESCRIPTION: 


SEQ ID 


NO: 4 : 








Met 
1 


Leu 


Trp 


Asn 


Val 
5 


Val 


Thr 


Ala 


Gly 


Lys 
10 


He 


Ala 


Ser 


Asn 


Val 
15 


Gin 


kys 


Lys 


Leu 


Thr 
20 


Arg 


Ala 


Gin 


Glu 


Lys 
25 


Val 


Leu 


Gin 


Lys 


Leu 
30 


Gly 


Lys 


Ala 


Asp 


Glu 
35 


Thr 


Lys 


Asp 


Glu 


Gin 
40 


Phe 


Glu 


Gin 


Cys 


Val 
45 


Gin 


Asn 


Phe 


Asn 


Lys 
50 


Gin 


Leu 


Thr 


Glu 


Gly 
55 


Thr 


Arg 


Leu 


Gin 


Lys 
60 


Asp 


Leu 


Arg 


Thr 


Tyr 
65 


Leu 


Ala 


Ser 


Val 


Lys 
70 


Ala 


Met 


His 


Glu 


Ala 

75 


Ser 


Lys 


Lys 


Leu 


Asn 
80 


Glu 


Cys 


Leu 


Gin 


Glu 
85 


Val 


Tyr 


Glu 


Pro 


Asp 
90 


Trp 


Pro 


Gly 


Arg 


Asp 
95 


Glu 


Ala 


Asn 


Lys 


He 
100 


Ala 


Glu 


Asn 


Asn 


Asp 
105 


Leu 


Leu 


Trp 


Met 


Asp 
110 


Tyr 


His 


Gin 


Lys 


Leu 
115 


Val 


Asp 


Gin 


Ala 


Leu 
120 


Leu 


Thr 


Met 


Asp 


Thr 
125 


Tyr 


Leu 


Gly 


Gin 


Phe 
130 


Pro 


Asp 


lie 


Lys 


Ser 
135 


Arg 


lie 


Ala 


Lys 


Arg 
140 


Gly 


Arg 


Lys 


Leu 


Val 
145 


Asp 


Tyr 


Asp 


Ser 


Ala 
150 



Arg His His Tyr Glu Ser Leu Gin Thr Ala Lys Lys Lys Asp Glu 

155 160 165 
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Ala Lys He Ala Lys Ala Glu Glu Glu Leu He Lys Ala Gin Lys 

170 I 75 

Val Phe Glu Glu Met Asn Val Asp Leu Gin Glu Glu Leu Pro Ser 

Leu Trp Asn Ser Arg Val Gly Phe Tyr Val Asn Thr Phe Gin Ser 

200 205 

lie Ala Gly Leu Glu Glu Asn Phe His Lys Glu Met Ser Lys Leu 

Asn Gin Asn Leu Asn Asp Val Leu Val Gly Leu Glu Lys Gin His 

230 235 

Gly Ser Asn Thr Phe Thr Val Lys Ala Gin Pro Arg Lys Lys Ser 

245 250 

Lys Leu Phe Ser Arg Leu Arg Arg Lys Lys Asn Ser Asp Asn Ala 

~ - — 265 *• ' u 



260 



Pro Ala Lys Gly Asn Lys Ser Pro Ser Pro Pro Asp Gly Ser Pro 

Ala Ala Thr Pro Glu He Arg Val Asn His Glu Pro Glu Pro Ala 

290 295 

Gly Gly Ala Thr Pro Gly Ala Thr Leu Pro Lys Ser Pro Ser Gin 



305 



Pro Ala Glu Ala Ser Glu Val Ala Gly Gly Thr Gin Pro Ala Ala 

320 325 

Gly Ala Gin Glu Pro Gly Glu Thr Ser Ala Ser Glu Ala Ala Ser 

335 340 

Ser Ser Leu Pro Ala Val Val Val Glu Thr Phe Pro Ala Thr Val 

350 355 

Asn Gly Thr Val Glu Gly Gly Ser Gly Ala Gly Arg Leu Asp Leu 

370 



365 



Pro Pro Gly Phe Met Phe Lys Val Gin Ala Gin His Asp Tyr Thr 

380 385 

Ala Thr Asp Thr Asp Glu Leu Gin Leu Lys Ala Gly Asp Val Val 

395 400 ^ 

Leu Val lie Pro Phe Gin Asn Pro Glu Glu Gin Asp Glu Gly Trp 



410 



415 
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Leu Met Gly Val Lys 

425 

Glu Lys Cys Arg Gly 

440 



Glu Ser Asp Trp Asn 

430 

Val Phe Pro Glu Asn 

445 



Gin His Lys Lys Leu 

435 

Phe Thr Glu Arg Val 

450 



Pro 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Glu Asp lie Trp Lys Lys Phe Glu Leu Leu Pro Thr Pro Pro Leu 
15 10 15 

Ser 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14985 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: unsure 

(B) LOCATION: 13 3 2 

(D) OTHER INFORMATION: /note= "unsequenced segment 1 

(ix) FEATURE: 

(A) NAME/KEY: unsure 

(B) LOCATION: 3 225 

(D) OTHER INFORMATION: /note= "unsequenced segment 1 

(ix) FEATURE: 

(A) NAME/KEY: unsure 

(B) LOCATION: 7 2 09 

(D) OTHER INFORMATION: /note= "unsequenced segment' 
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(ix) FEATURE: 

(A) NAME/ KEY : unsure 

(B) LOCATION: 11097 

(D) OTHER INFORMATION: /note= "unsequenced segment 

(ix) FEATURE: 

( A ) NAME / KEY : exon 

(B) LOCATION: 1..324 

(D) OTHER INFORMATION: /note= "Exon 1" 

(ix) FEATURE: 

( A ) NAME/ KEY : exon 

(B) LOCATION: 325.. 1618 

(D) OTHER INFORMATION: /note= "Exon 2" 

(ix) FEATURE: 

(A) NAME/ KEY : exon 

(B) LOCATION: 1619.. 3174 

(D) OTHER INFORMATION: /note= "Exon 3" 

(ix) FEATURE: 

(A) NAME/ KEY : exon 

(B) LOCATION : 3175.. 4365 

(D) OTHER INFORMATION: /note= "Exon 4" 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 4441.. 11518 

(D) OTHER INFORMATION: /note- "Exon 5" 

(ix) FEATURE: 

(A) NAME/ KEY : exon 

(B) LOCATION: 11519.. 11850 
(D) OTHER INFORMATION; /note= "Exon 6 



ti 



(ix) FEATURE : 

(A) NAME/ KEY : exqn 

(B) LOCATION: 11851. .12240 

(D) OTHER INFORMATION: /note- "Exon 7" 

(ix) FEATURE: 

(A) NAME/KEY; exon 

(B) LOCATION: 12241. .14129 

(D) OTHER INFORMATION: /note- "Exon 8" 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 14130.. 14985 

(D) OTHER INFORMATION : /note= "Exon 9" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TCGATCTAGC AGGATGAGCT GCCGTCCCTG TGGAACAGGT GAGGCCCGGC 50 

ACGGTGCCCA GCCTGCGTGG GGCAGTGTCC AGTCTGCGTG CTGCAGTGCC 100 

CAGTCTGCGT GCTGTGGTGC CCAGCCTGCG TGCTGCGGTG TCCAGTCTGC 150 

GTGCTGCGGT GCCCAGCCTG TGTGCTGCAG TGTCCAGCCT GAGTGCTGCG 2 00 

GTGCCTGGGC CCTCTCTGGT TTGTGCCTCT GATGAGCGTG TGTGGTCGCT 250 

CGTGGGTGGG TATTTCTGAG TTGCTGTCCT GACCTGCCTG TTCACCTGGC 3 00 

CCCCATCCTT CCGCCCTTCT GTAGCCGCGT AGGTTTCTAC GTCAACACGT 3 50 

TCCAGAGCAT CGCGGGCCTG GAGGAAAACT TCCACAAGGA GATGAGCAAG 4 00 

GTAGGCCATG GGGACCCCTC TGAGGGGCCA CACCCCACCC TGGCCGAGGG 4 50 

TCAGAGTCAG AATCGTGGGA GGGGCAGCCT GAACTCCTCC TTCCCTGCCA 500 

GGTTCAGCAC ACACCGGTGA CCACAGGGCT CCCTCCCGGC CCTGGTGGAA 550 

CAGCCCCCTT CAGGAGTGCC TGGGCCCCAG GAAGGGCACC AGGGCATGCT 600 

GGGGAGGCTT TGAGAGTGTC CTGGGTCCTT GCCTGGGTAG ACCACCTGAG 650 

AATGTAGACC AGGCCCTCTC AAACTGTGAA TGTGTTCTCT AGCAACCTTG 7 00 

GCCCAGGGAG TGCAGTGTTA GACAATGGTG GCAGTTTCTC AATTTGTAGG 7 50 

AAAAAATTAC TAGCATTTTC TCAATTTTAA TTTTTCTCCC ATGCTTTTGA 800 

GACATTTAAA TAGGCCTGTT TCGTCTGCGT GAATCCACTG TGCGACAGGG 850 

TCTGGTCTGA TTAGCGTACC TTTCTCTCTT GTGTTATTTT CATTTTAACC 900 

TATCTGGCAA CTAAAACGCA AAGCTGTTGA ACTTTGCAGT TGGAGAGACC 950 

CAGGGCTGG C ACCCCCGCCC CCAGTGGTGG GTCTGACTTT GTCTCTTGGG 10 00 

CCCAGATGGA TGAAGTGATG ATGGTGGATT CCAGGCCCAG GGTCGGAGGA 1050 

GGAAAAGCAG CTTGGGGCCT CCCCTGACTC ACACTTCTAA GTACGGTTTC 11 00 

CTGCCTTTTA GACTTTCTCT TTCCTTCTTA ACTTTTCCTT TTTCCCTTCG 1150 

AAGATTGGAG ACAACTTACG AAAAGTTTAA AAAATAGAGA AAGGTGTTTC 12 00 

AGAGAAGGAA CATTTATATA AAATTAGTCT GTAAAAATAT ATGCCACAGA 12 50 
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ACATAGTTGT TTATAAATAG GATGAAGATT CGGCCTGAGC TCCTAGTGGC 13 00 

CAAATCAAAG AAANGAATAT CTCAGTAATG ANGGNGTTNA TAATGGGGCA 13 50 

TTTTAGGATA TCCCTAATAA GTAGGGAGGG GAAAAGATGG TTCCAAGAGA 14 00 

CAGCAGGCAT TAGNGATATA GCCCANTTGC CCGGTTTTN G NNGCCCCCTN 1450 

GGTTTTTGGG GGGNTTGGTG TGCNAGTTCC TTGCTCTCAC AGATGGGGAA 1500 

AGAGGAGCTT GTTCCTGGCA GGGG CTGGGG GTGGTGGGGA GAAGCAGAGG 1550 

TGTTTGGGGA AGGTGGGGCC GTTTGGTGGC CTTGGAGGCC CCCCACCTCC 1600 

TCACTGTCTC TCCTGCAGCT CAACCAGAAC CTCAATGATG TGCTGGTCGG 1650 

CCTGGAGAAG CAACACGGGA GCAACACCTT CACGGTCAAG GCCCAGCCCA 1700 

GGTGCGTGCG GGGAGAGCCC TGGCGCCCCT GACTGTGTGC ACGGCAGGGG 1750 

CAGGGCTCCT TCCTGTGACC CTGTTGGTGC CCTCCCCTGG TCCCCCATGG 1800 

GTTTGGCCTT GGGGGTCTAG GGACCTTCCT GTCTTGGCCT CTCTGTGCTC 1850 

AGGGAGGCAG GTGAGGGCAG GTCTCTGTCT CGAATGTCCC TGCCCCTCTG 1900 

GCTGTGTTCG TCGAGGAAGG AGCACTCTGG GGAGTCCGCG GGTACCCTGA 1950 

GCCGGCTGAC GCCCTCATTG TGGAGCACGA GCATCCAGGG TTGGGGTGGG 2000 

CAGCCTGCTC AGCTTTGGGG ACTGGGGGGT GTGAACAGGA CTGAAAGACT 2050 

CCGGGGTGTG CAGTCCTCTC AGAGCAGGGA GATAGCACCG CCCTTCCTCT 2100 

CCTGCTNGTG GNAAAAGATC ATGTCCCTGG ATGG C AG CAT TGTGCTCAAC 2150 

CACANGAGCA TCCTCTTCCT GTCCTCAGCC TCAGCCCCTC CGGGAATCCC 2200 

AGCTGCAAGG AGGCCTCTGT TTCCTGAGGG GAAACCATGA GGGAGGAGGG 22 50 

AAATGCCTTG CTTTCCTGGC TGTGGATCAG AGGAAGCAGC GAGCCTGGGA 23 00 

CTTCCCCTCC CTTNTGGCCA TGTGTGCATG TGTGTGTGTG AGGGGGACTG 2 3 50 

TGTGTGACAG GTGTAAGTGT GTGCATACCC ACACACATAT CACAGCAGAA 2 4 00 

CGCAGAGAAC ACCGATGGAC TCTGTAAAAC AGGGCGACTG TCTGCTTCTT 2 4 5< 

GGGGTATTGC CTGGGATGAT GAGGGTATCG GGTGGTGGTG ATTGCCCCCT 2 50 

CCTTCCCTGA ACATAAAATA GTTGTGGCTG AGAGAGGGGC CATGGTGACC 2 55 
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TGAGGCTGGG AGTGGGGAGG TTAGGACGGT 
GGGGGTGGGT AGGGGGGTGG GGGTTGGGAT 
AGGTCGGGCA GGGAGGGACA GCTGCCTGGC 
ANTGGGATGG ATCAGCAGGG GGTAAGTGGG 
GGCTCCCCTC AGGAAGGAGG TGGAGGGAAG 
CAGGAGTCTC CCTTCCAGCA GAGAGGCCTC 
TGGGGGCCTT TTCTGGCACT NTGGGCACAC 
GACCACAGGC AGGGTGACTA TCCACTGCCC 
AGGCCCTGTT AGCATCACCT CGGGCACCTG 
GGGCACCCCG GGATAGCACG CCCAGGCCCT 
AGGAGAGGCT TTTGCCCCTT TGTCCTCTGA 
GGAAGGGGAC AGGCCAGTTC TGCACCTGGC 
TCCACCTCCC GTCCGTCCCC ACAGAAAGAA 
TG CG CAG AAA GAAGAACAGG TACCNAGGAT 
GGGGCAATTT AAGCAAATGG AGGAAGGNGG 
GGGAGGGCCA GGGAGGGTGA NAATGCAGCA 
TTTAAAACAA ATTGTATGTG TATACCATAT 
AGGAGAATAC ATTCCCATAA AACACAAATT 
AGCGACATCT CTTACGNTGT TCCACTGTTT 
CTGGTTCCTG CTGGATGNTT GTAGATGTGC 
GTTACCCCAT GGGGCTATTG AGCACTTGAA 
AGCGGAACTT TTCATTTGAA TTACACATAA 
CATTGCCGCC TGTGGCTAGT GGCAACCGTA 
CGTCTCTGTG CAGGTTCTGG TAGAGAATTT 
TGTGCCTGGG GTGCACAGCA TCACACCACC 
TGAACGCCAT GGCTCAGGAC CTGTCCCCTC 



GGCGTTGTGG 


TGGTGGTTGG 


2600 


AAAGCCAAAA 


GGTGAATTCA 


2650 


CTGTAGGCAC 


AGGTGGGAAC 


2700 


GCCGTCCTGG 


C CAG AACCAT 


2750 


AGAGAGGGGC 


AGTAGAGGCC 


2800 


TTGTGCACTN 


TGTGCTCGCC 


2850 


CTGGAGCTCC 


TGGGGACTGG 


2900 


CGAGCCTCCT 


GCCCCTCACC 


2950 


GCCACAGCAG 


GGGCCAGTCA 


3000 


GTGCAAGGCC 


TCTGGCACTT 


3050 


GCAGAAGGGT 


TGGCAAAGAG 


3100 


CTTTCTCCAG 


AATGAAGGCC 


3150 


AAGTAAACTG 


TTTTCGCGGC 


3200 


GGTGCCCCGG 


GGCCAACCCA 


3250 


GNTGGGGAAA 


GGAGGCNTGG 


3300 


AATGTGGGGG 


TTTTTTGTTT 


3350 


ACTTATACAT 


ATATTCTTTA 


3400 


CCAGAAGGAA 


AGATGGTGTC 


3450 


GCCCTCAGGT 


GANTCGGTCA 


3500 


ACTGTCCAGC 


ACAGGAGCCA 


3550 


ACTGGCCAGT 


GTGACCGGGC 


3600 


ATTTCATTGC 


TTTGAGTTTG 


3650 


CTGGGCAGCA 


CTTTTCTAGG 


3700 


TCTCCCTGCA 


CCTTCGCCCC 


3750 


TCCGCCTTGG 


GTTCTGGCAC 


3800 


CATCGCCAGC 


TGCCCACTCC 


3850 
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TCTGTGATGA GGACGCCTCT CTTAGTTTGT CCAGGCCCTG CTTGTGGCCT 3900 

CCAGCAGCCG AGAGGACAGG AGAGCCCAAG GTCTAGAGAC ATGTACCAGG 3950 

GTGCTGTGAT GGACAGGCAG GGAGGGCAGC AGGCTGGGGA GCAGACCCCA 4000 

GAACAGAGGG GCTGCTGCGT GTGGTGTGGG AGACTCACTG TGCCTCTAGG 4050 

ATGTCTGGCT TTCTCCTGCT GTGGATCTTG GGCTGTCAGC ATGGGCCCTG 4100 

GTGGACCCCA TGGAGCCTGT GGGGTGGTTG GTCTGGTCTC TGCGACAGAT 4150 

GGTTCCAAGG GACCTGCCTG CACTCCTGGG G AC CAT AG AC CTCCAGCCTG 4200 

GAGTCCCACC TTGTGCTGTT CCTGTTTCTG AGGCAGGCTT CCCACTTCCA 42 50 

GCCCCCCAAG CCCAGGTCCC TTGGCTCCCC CCACCCTCCT GCTCTCTCTC 43 00 

ACATACACAC ACACACACAC ACACAGTTTC ACACCTCCAT ATGCACACAC 43 50 

CTCTTCACAC AGACGTCAAT ACATTTGCCC CTCCGTCTCC TGTGCCTTGG 4400 

CCCCCCAACA CTGGGCTCCC TTTCTTGTCC TCCCCCACAG TGACAACGCG 44 50 

CCTGCAAAAG GGAACAAGAA CCCTTCCGCC TCCAGATGGC TCCCCTGCCG 4500 

CCACCCCCGA GATCAGAGTC AACCACGACC AGAGCCGGCC GGCGGGGCCA 4550 

CGCCCGGGGC CACCCTCCCC AAGTCCCCAT CTCAGGTAGG GAGNGAAGTT 4600 

TTGGTAGAAG GTCCCAAGCC NTCCATCNAT NTCGT CNGGG ATNGGCTGTT 4650 

GTCCTCCATC CTCCCACTCC CTGTCCCCTT TCTGGCCTGG GCAGCTATGG 47 00 

ACCCGATGCT TTGCCCAGTG GGGGTTGGGC CTGGACCTGG GTGTCCTTTC 4750 

CCCCATCCAG CTGGCATGCT TCCGGGAGGC ATCTGTGACT TGCTCGTTCC 4800 

TCCCCAGCCC CCACCCCCAC TGCAGCGCCC TCCCCTTCCC TCTCCNTGGT 4850 

GTTTTGTGAT GNTNGANTCT TTNTCCATNT TNTTTTCCTC CCTAGCAGAG 4900 

GGTATGGGCC CTCAGCCCAC AGAGCATCCA GATNTCCAGA GTGGG CTGCC 4950 

TGTCCCTNTT NNTGGCCTCT CT CTTTT AAA GGGGCCTGAG GGAGGAGCCC 5000 

AAGCCAGGTG GCCATGCAGG AC CTTT AAAG GGACAGAGAG AGGAAGGGGT 5050 

C AG AGG AG G G TGTGGGGTGG CTAAGGGGGC AGGTCNTGGG NTTGTG G ANT 5100 

GTCCTTGTGT CCACCCCGCC TGCCCCCAGC GGGCCCTCCA GGTGTANGCA 5150 
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GGNCCCTAGG TGCTGGCTGG 
CTGCAGCTCC CGTTTTCTGC 
GCAGGGCTAA GGACATGGTG 
CCTGGACATG GTGCCAGTGC 
AACACCCCTG TGAAGCAGGC 
AN CGTCATGG GTGTGTGACC 
GTGTGGCCTC TGCCCCATCC 
CTCGTCTGAG CTGGGCTCCC 
AAGAGGGGCC TGCGGGAANG 
GCTGTCTCTC AACGACTGTT 
GGAGCCAGCC CCTCCCACAG 
TTCTTCGCTC TCCTTCCTCC 
CCACACAGCG TTTCTGGACC 
CTGGGGTCTN GGTGTCTATG 
TTTGCTGCGT CCTGCTCANT 
TAACCGTGGC TTTGTCT CTG 
TGCCTTCCCG CGCGGCCCCT 
GCCCACCAGT CCCTCCGCCT 
CAGGAGCAGA TCCTCAGCCT 
CGTGACCACC CCCTCCCAGG 
TCTCTTCCTG CCCTCTCAGG 
GTCTCAAGAG CCAGGAATCT 
ACCCTTTGTT CCCTCCATCC 
TCTGAGCATG TACTAAGGCC 
CCTCTCCCGG CGCAGCAGTT 
TGCTATGGCA GAGGGGATCG 



55 

CCAGGNGGGG GAGTTTTCAT 
TGCCGCCCTG CTCTGCTGCT 
GGAGCCTGTC C C AAA C AG C A 
CTT CTGTGT A TTCGTTCACT 
GCTGTCATCA CCTGATGCAT 
TGCCTGAGGT CCCCCACCTG 
TGGTGCCACG CTGGCTTCCT 
TGTGGGCAGC CCTGTGCCCT 
GAGAGGTGGG CAGGGGGAGG 
TGCTTCCCCA GTCTTCTCAC 
TTGGCCAGTG GGCAGCCTGG 
TCTCCCCTCA CTTCTCTATC 
GCCTGCCTCA GTGTCCCTCT 
TTGGGGGGCT GGGAAGGCAN 
GGCCTGGGTG GGATGTGGCT 
TCTGTCTCCC CCAAACCCCG 
CACCCGCCGC CGACCCACAG 
CCCAAACACA CCCCGTCCAA 
GTTTGAGGAC ACGTTTGTCC 
TCAGCCGCGG CCGCCGCGGC 
GCGTGCATGG CCTTCATCCT 
GGCCAGAGAG AGTGTCAGTT 
ATCATCCTCC ATCATCCTCC 
AGATGCAGGG CCGCAGAGGG 
ACATCAGCAG CGCCCTCGCG 
GGGAGTGTGG AGGACTGTGG 



AGCCGGGATC 5200 

GACTAGG ATA 5250 

CTTCCCCCGG 5300 

GAGTCCCCAC 5350 

GAGGAAGCCC 5400 

GTGGGCAGGG 5450 

CTGGG AT A CA 5500 

GGGAGGTGGA 5550 

CTGGGGCCCG 5600 

CAGGCCAGTG 5650 

GGCCTCTCTC 57 00 

TCTTCTCTCT 5750 

CGGGGGTGGC 5800 

TNACTCTTCA 5850 

GAGGTGTGAC 5900 

TGCTCTGCTG 5950 

CTCCGGAAAG 6000 

GGAAGTCAAG 6050 

CTGAGATCAG 6100 

CCAGCTCTCC 6150 

CTATG CTTCT 62 00 

TCCCTCTCTC 6250 

ATCACCCATC 63 00 

GAAGGTGCCG 63 50 

ATGCAGTGGG 64 00 

CTGTC AGGGA 64 50 
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AGGCTTCCAG GGCCAGGGAG AGTTGGAAGG TCCTGGAATG GCTGAAGCAC 6500 

CTGGACTTCA GCTCCCACAG CTGCTGTCAG CCCCTCGAGG GCGGGGGCAG 6550 

CGGCCAGGCT GCAGGGCANA ACTGCCGGTG TGCAACACTC CCTAAGAGGC 6600 

GTGGAATGCC C AG AT AC AG C AGGGAGCCAC CCAGGGGGGC TTGGGTCTCT 6650 

CCCGACGGGC CCTTGGCTCA GCAAGGAGCC ACGCAGAGGG TCTTGGGTCT 6700 

CTCCCAGTGG GCTCTTGGCT CAGCCGTGGA GGTGCCTCTG GGGAGCCCGG 6750 

CCCACAGCCC CAGGTCTTAC GTCCTTCATG GTGGAGGTCG GGCTGGAGTA 6800 

CCTGTGCTGG AAGCGCATCT TGCCAGTGCT GGAGTGGGCT GACGTGTTGT 68 50 

CAGATTTGCC CAGAGGTGGC CGGCCTTCCC CGCACTCCCC GAGAGCTGAC 6900 

TGCCTCCTCA AGGTCCAGCC CTCAAGGCCT CACCTTCCTC CTGTGGGTTA 6950 

GCCAAGAACC TTCCCACACA AACCTCCCCT GTTAGGAAAG CTGTCCATCC 7000 

AAGCTTGTGG TGGCCTCCCA ACAAACACCT TCCACACACT CAAAAACCCT 7050 

ANTGGGGANT AGTTTGGAAG GTTTTAATTT TNGGGAATTT GCCCNCTGGG 7100 

AACTTGCAAA CANTGGTCCC CTGCTAAGAA AGGTTTGGGA NTGGTGGGCC 7150 

TCCAACCCCC TNTGCNAAAA NNTAGGAAAT TAAAACTNAG GAACCNAAGG 72 00 

CNNCCGCCNC TTGCNTTTGT CCATGAAANN NNNNGCCCAC GGGCTTACCC 72 50 

GGNTGTGGGG TGGTGNGTAG CGTGTGTCCN TGACATGGAG GGACNGTCCC 7 3 00 

GGGCCTGCAT GGCGGGGTGC CACCTGCCGG GGCAGCACAG CGAAGGGATG 7350 

GTC AG CTTTT TGGCGGATGA CCCTCCCCTC AG C AC ATG AC GGATATTGCT 74 00 

GCGTGGGTTG GCTGACTTTT ATGAGACAGG AGGGAGGGGT GTTG CTGGGG 74 50 
CAGGGTGGGG GCCACTGGGG AGAGATGCTG GCCGCCCGCT GGTGGGAGGC 
ACCTCGAGGC TGTGCACCGG CGTCCTCAGG GCTCCTTCAG AGACGGCCGG 
TTATGGGGCA GAG CAGTGAC CTCCCGACCC TGGGTTCCCC AGACAGGGCT 
GGACCTAAAG GAAAGT C AG C TG CTGGG ATT GGCCCAGGGC AGGGCTTGGG 
GCCTAGGGCC CCTGGTTCTA GGAAGTGAGT CCACTTGGCC TGAGCTGTCT 

GACACCTTGG CTTGGCCATG TGGANTGCTC CACGCTTGTC CCCTGAGTGC 77 50 



7500 
7550 
7600 
7650 
7700 
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AGGACAGCTG GTCTTCTTAG GACTGAGGAC 
CTTCGGTTCA GNTATGGAGT GCTTNTGGGG 
GTCTCACATC CTTCATGGTG GAGGCCGGGT 
GTCCGGTGCC TGTCAAGAGT GTGTAGAGCC 
TGCGGGGCTG GAGTTCTTCC AGCACTGCCT 
CCCCGGCCAC GTCCCTCTCC TTTTAAACCT 
CCCAAAGGGA AGCCCCAGGT ACAAGATGGA 
CTGCTCCTCA GXAGGCAGCT TGTGCCCTGG 
CTCACCGGAG TCATTGTCCC CGCCTGTGCC 
TTCCTCCCAT GCTAGTCTGG CCCAGCTTAT 
GGCCAGGGCC ACCGTGTCCC ATCCCTGGGG 
TTCCTGGGCT CACTGGAAGG AG AT CTTGG C 
CAGGGGCCTG CAGTTTTAAC CCAAGTGCCC 
GAATGTTGAA GACCCCACTT TGAAGCTTGG 
CTCCACTCTT TCTCCGTCCC TGGAGCTGAC 
AGTGACCTGC CTGTCTGGGG TGGAGGAAAA 
TTTGGAACTT TCCCAGTCGG GAGCACTGAG 
ACTCTCGCCA CTGGGGGGTA GCAAGATCAG 
GCGGAGAGGA GGTTCGGGCA TTGGTAGGGA 
TGGTGGCGGA AGAGAGCTGG GGCCTGGTGC 
CACGGGCGGA CCTATGACTG GGAGTTTGAG 
CCTGGTGTGG GCGGGAGCCT GTGCCGGGGC 
CCCGTGTGTG AGGTGGATGA GTTGGTGCCT 
TGCGTGTATG TGCGCTTGCT CTGTGCATGC 
TGTCCACGCG TGTGCCTGTG CCTGCAGTGT 
CAGCTTAACA CTAACTGCTT CCTCCTCTGC 



CTTGGTNTCT CCCAATGGGC 7800 

AGCCCGGCCC ACAGCCCCAG 7850 

GGGAGGGGGC CCCTGTCAGT 7900 

GGGAAGCCGC TGGCCTGGGC 7950 

GAGGGCCCCG GAGGGGAGCA 8000 

GGGCAAAGTT CTCTCTGGCC 8050 

GACCGCAGCC GAGCCAGTCC 8100 

GCATGGATGC TGCCCCAGGG 8150 

GGGGGCTCTA AGGAAGCCCC 8200 

GGGGAGGCTT GTCCCTGTGT 8250 

CCATGCCTGT CACATGCCTA 8300 

GAGGGGCTGC TGGGAGGGGT 8350 

CGGGTGGTTC TGAAGCCCCC 8400 

CTGTTGGGCT TTGTGGCTGG 8450 

GGCTGGTGGT GTCGCCAGAG 8500 

GCCAGTGTGA AGTCTCTGCC 8550 

GGTGGCTGTG GCATGGTGTT 8600 

CAGAACTCTT GGCGCAGGGA 8650 

GGGGCCCACC AGTCTGTGGA 8700 

TGGCCCTGCG GGGTGGCGGC 8750 

GCGGGCACTG GGGTCGTCCT 8800 

GCGTGGCTTT GGG CAGTGCT 8850 

GGGCTGTGTG CCAGCGTGTG 8900 

GTGGTGTGTG TATGTGTGTG 8950 

CTGCCTGGGG TGAGGGCTCC 9000 

TGCTGCTGCT GCTGCCAAGT 9 050 
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TTGAGGCCCC 
TTTGACCCCC 
TGGTCAGGTT 
GTGCCCAACC 
GCCTGGGTCA 
CTCCTGGTTG 
AGTAGGGGGT 
GCAGAAGCAT 
GCTCGGCCAC 
TGCCTGCTGG 
GACTGCTTGC 
CTCACTGCTG 
CACTCTGCTC 
TCCAGGCCCA 
CCCCCCACAA 
GAAGCCACCC 
TGGAAAAGGG 
GGTCCTGTTT 
GGTCACCCAG 
AGGGTAGAAG 
TTGCGAGGAA 
AGACAGGGTG 
CGAGGCCATC 
ACAGAGGCTC 
AG C C AG AATT 
TACAGGCAGA 



GGGGCTTATC 
TCCCGCCCGT 
GGTTGTGCCC 
CTGGGCTCAT 
CTGCCCTTCC 
TAGGGCAGGA 
TACTGAACAC 
CCTGGCTTGG 
CTGCAAATCT 
GGTGGATGGC 
CGCCCTGTCT 
CGCTCAGTCA 
TTTGTCCACC 
CATGATCATA 
GCAGGACACA 
GCCTCCAGCC 
GATCTTCGGT 
TCACAGCCTG 
GATGGACGAG 
GTTCTTGCCT 
GGGGAGAGGG 
TGGCTGAGTG 
TCACACAGCA 
AGGGAAGGCT 
CCAACAGGGT 
GATG CTGGG A 



58 

TCGGAGCAGG 
GACGAGCCCT 
ACCACTGCCC 
GTTGCCTATT 
CCTGGCACCT 
AGCAGCCCCT 
TTCCTGGCAA 
GGTCTTGAAG 
TGGAGTCTAC 
CTGGGGCGAG 
CTAACCTCTG 
ATT CCATGGG 
CCCTGGGGGA 
GCCTGTTCAC 
CAGGCACGTG 
ATTNTGNTGC 
TTAGCTTGAG 
TGAGTGTTTG 
TGTCCTCAGG 
TCTCTGAGCT 
GAGGAATGGA 
CCTTCTGAGA 
CCATGTCACT 
CAGTACTTGC 
CTCCTGGGCT 
GGGGCAGGCG 



CCAGTCTGCT 
GTGAAGGCAC 
ATGGGCCCAC 
GGCCACGTGA 
CAGCCTTCAG 
GATCAGCTGG 
CTTTGTGCTC 
CTCCCTGAGA 
CTGG CTCCGA 
CGGGGGTAGG 
TGCTAACTGT 
ACCTCTGGGA 
ACCACTCTTT 
AGGTGCATGC 
CTCACGCACA 
TTCTCCCTCT 
ACAGGGGTCC 
CCTCCAGACA 
GTGTGGGGCA 
TTTTGGCAGT 
TGGTTTGAGA 
GCAGGGCCTG 
GTCACCTGAT 
CCAAGAACTG 
CTGTCCCTGA 
GGTGTGCAGG 



GGACCTGGAC 
CCACGCCCTC 
CAGCTTCCAG 
CCCCAGCTAG 
CCCTCATCAC 
GAGAACTCTC 
ATCGCTTGGG 
GGTGTCGGGA 
GCCACTCCTG 
GTCCCCTGGG 
CCTTCTCGCC 
GGTTAAGCTG 
CCCGTATGTG 
ACCCCACACA 
GGGAGNTGGT 
GGCAGGCCCT 
CCTGAGATCT 
GAAATGGGCC 
GGAGGGCCTC 
GGGGAG CTGT 
GAATCAGGGA 
CAGGCAGGTG 
AGCTCAGGAC 
GTCATGGTAG 
GACCCCCTGA 
CGC CCTTGGG 



9100 
9150 
9200 
9250 
9300 
9350 
9400 
9450 
9500 
9550 
9600 
9650 
9700 
9750 
9800 
9850 
9900 
9950 
10000 
10050 
10100 
10150 
10200 
10250 
10300 
10350 
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GCATGCGCTG 


GCAGCCCAGG 


CTCCTGGGAG 


CTCTGGAGGC 


TCCACCGCAG 


10400 


GATTTCCCTC 


TGGAGGAAGC 


CAGAAAGAGC 


CAGCCTGGTG 


CG AG CTGGTA 


10450 


GGGCCATTTT 


GACAAGTGGA 


TTTCGGTAGG 


TGCTGAGCTT 


GGGCAGCACA 


10500 


GTCACACCTG 


CCTGTCCCTT 


TGACAGTGGT 


AGGAGAGAGG 


ATGTGGGAGG 


10550 


CGGGTGGCTG 


GCCGGGCTCC 


GCTGGTACCC 


ACCCTGCCCC 


CACCAACCCC 


10600 


AGCCGCTGGT 


GACATTTTCT 


CTTGTCTTGT 


GATCCTGCCC 


ATTGCCTTTC 


10650 


CACCCCGGCC 


TCCCCGCCCC 


CTCCCTGTTC 


TCTCCTCGTG 


GCCTGTTACC 


10700 


AGCCCACAGA 


GAGTCCAGCC 


GGCAGCCTGC 


CTTCCGGGGA 


GCCCAGCGCT 


10750 


GCCGAGGGCA 


CCTTTGCTGT 


GTCCTGGCCC 


AGCCAGACGG 


CCGAGCCGGG 


10800 


GCCTGCCCAA 


GTAAGTGCCC 


ACCTCCAGCC 


CCTGTCTGGC 


TTGTCCCCAG 


10850 


TCTCTAGGGG 


TGCAGCATGG 


AAGGAGAGCC 


CCGAGGAGGG 


GTTGCAGGAG 


10900 


GGACCAGGCC 


ACCATGGATG 


TGAGGGTGAG 


GACAGGGTCC 


TG AG CTAGGC 


10950 


TGCCCCAGCA 


CGGGCTTGTC 


ACCAAGGCTG 


CCAAGGATGA 


ATGAGCGCAC 


11000 


TGGGCGCATC 


AGCCCCTCCT 


GCTTGCCCAC 


CCCAGCCCAA 


CCTCCCACGC 


11050 


AGGAAGACAT 


TTAGGAACAC 


CTACTGGTTT 


ATGCCAGCAC 


TTTCCANGTG 


11100 


TTGTGTCCCC 


CTGCCATGGA 


TTATNTATAG 


GTGCAGCAAG 


ATCTTGCCAC 


11150 


CTGCCGGTCA 


AGCAGGGTGG 


GCGGGCGGTG 


GCTGTGGTGG 


GCGAGGTCTT 


11200 


GGTGCCGAGA 


GAGCAGGGCC 


TGTGAGGCGG 


GGTTGGGGGT 


GGCACTATGG 


11250 


GGCTTGCACT 


GGGTTCTTCA 


CAGCATTGTC 


ACTCACATCC 


TTGGGCNTGC 


11300 


CAGCGCNTAC 


TATTCAGCTG 


CTTCCCCGGC 


CCAGGGCCCA 


GCTTGTCCAG 


11350 


CAGAGGCTCC 


CNTGGATTNT 


TCGAGGCACT 


GGGCAGCTCT 


AGACCNTGCT 


11400 


GCCAGCCAGG 


CGATGCCCCC 


GGCCCTGTTG 


CTTGGGTGCT 


GCCCTCCTGT 


11450 


GGCCTGTTTC 


CTGTGTCCTG 


GCTGTGTCCT 


GTCCTGTGTC 


TGACCCCAAG 


11500 


CCGGCATTTA 


TGTTGCAGCC 


AG C AG AGGCC 


TCGGAGGTGG 


CGGGTGGGAC 


11550 


CCAACCTGCG 


GCTGGAGCCC 


AGGAGCCAGG 


GGAGACGGCG 


GCAAGTGAAG 


11600 


CAGCCTCCGT 


AAG AC AG C AG 


GGACAAAGCC 


CTGCCTTTTC 


CTCCCTGCCG 


11650 



WO 96/34627 



PCT/US96/06231 



60 

CCCGCCTGCC TGTCCGGGGC TCCCCTGTGG CCCCTGATGG TGCTGGTCCA li7 0O 

GGCCTGGCTC CTGTTGAGGA AGCTGGAGGC GGGCCGGTCT GGCACCAGGC 11750 

GCAGACACCT TTCTCCCCTC CCCGCCCCTC TTCTCCTCGG TGGCCCTGGC 11800 

TGT CCTTGGA CCACCTTCCC TGCTCAGCTG ACCCGTACCT CTGCCACCAG 11850 

AGCTCTCTTC CTGCTGTCGT GGTGGAGACC TTCCCAGCAA CTGTGAATGG 11900 

CAC CGTGG AG GGCGGCTGTG GGGCCGGGCG CTTGGACCTG CCCCCAGGTT 11950 

TCATGTTCAA GGTGAGCCCA CAGCCTCTGA CTGCTGCAGT CCCTCGGTGC 12000 

CCTGGTGGGC AGATGACAAC CCTGAGCCTC AGGAGACTCT GTGGTTTGCC 12050 

CAAAGTTGTG CAGGCGCTAC TAGGTCACTC CCAGCCAGCA AGGTGGCATC 12100 

TGANCCCCAT AGAGTCCTGC TGCTTTTGAG CACTCCTGGT CTCCATACTG 12150 

CCACCTGCAC CTCCCACACG CAAGGCCCGT GCTCTGTGCA GGGCTGGAGG 12200 

TGGGACGGAA GGTCTGACTT GCGATCCGCA TCCTCTGCAG GTACAGGCCC 12250 

ACCACGACTA CACGGCCACT GACACAGACG AGCTGCAGCT CAAGGCTGGT 12300 

GATGTGGTGC TGGTGATCCC CTTCCACAAC CCTGAAGAGC AGGTGAGGGC 12350 

TGGGTGGGGC CCCCACACCN CANGGGGACC ACCNNGCATC CTGGCTGCGG 12400 

CTGGCACCNC CGTNGCGGAT ACNCGCCATT CAGGGGGCAG CAGAGGCCCG 124 50 

CGAGCACCAG GGCTCCCGCG CCAACTGCTC CTCCCCGCCC TCCACGTCGG 12 500 

GCTTTTTCCT CTCTCCCTCT CCTCTCCCTT CCCTTGGCCC CTCTCCTGTT 12550 

AGGCCTCTCT CTCTCCCTGT CCCCATACCC GCTTCTTCCT GTAGCCTCTG 12 600 

CTTTCTTCTC CCCACGTCCC CCCTTTGCTC AGGCGCTCTC AGCTCTGCCT 12 650 

CTGTCTCTCC CCTTCTCCTC TCCTGGCAGC TGTGC CTG AG GCCTGCCTCC 12700 

CTCCTGGGAC AGGATGCTTG ACCCCTCCTG CCCCGCCCAC AAGGTGCCCA 12750 

CCCTGCAGCC AG CCGG AG C A CTGGTTGGGC TCATGAAGCC CCGTGTGCCG 12800 

TCCCTCGAGG CGGGCCCTGC CCTGTGCACN CAGGGCCATG GGCTTCCCAG 12850 

CTGTGTCCCC GGCTGAGGCT CACCCACGAT GCCTTCCAGA CCCTTCTCCT 12900 

CCTGCTGTGG CTTCATGTTA ATCTCCTGGA AGTGAGGGCT CCTGTTGAGC 129 50 
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CTGGGTGGGT GCTAAGTGTG 
TGGGTCAGTT TGCCCCTCCC 
TTCCCCGNTG TGTAGAACCN 
TGTGGGCATC TGGTTCTTCC 
GCACAGAGGG CCTGGCCGGT 
ATCAACGCTT CTGTGTGAAT 
GGTCTGCCTG GGTCTTGGAT 
GGATGTAGGG GTACCAAGTA 
TGATGAGGAC CTTGTGGGCC 
CCCAGGTCTT CATGGACCTC 
T C AAGC AG AC TGCTACCCAG 
TGGAGGGGGC TCAGGGCATG 
ACAGAGGGAA ATAACAAAGA 
CAGGGGCAGC CTCGGCCCAG 
CCCCGTGACC CACCTGGGGC 
CCTGCGGGTG TCAGGATTGA 
GGTGGGATGG CCCCAATCCC 
AACTTCGTGG GGTTGTTTGA 
CACATCATGC ATGCAGTGGG 
TCCACCTGCG GACTCACAGG 
CTCAGCCAGC AN CAT C AG G G 
GGGCTTGGCA GGCTGAGGCA 
GCAGGTTGGA GTGGGATGCG 
CCTGTGCGAC CTGNTGCTCT 
GTGAAGGAGA GCGACTGGAA 
CGTCTTCCCC GAGAACTTCA 
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TCCCTCCTAA GTCTTGGGAC 
CAGGGGGCCT TGGAATNATN 
AGCTTTGNTT GTGGGGGGTC 
ACGGTTC AG C CCCTGAGCAC 
TATTCCTGCT TCCAGAGAAC 
AGGTTATCAG AGCGGCTGAG 
GAGGCCGACC NTACTGGGGG 
CTTACTGAGG TCCGGGGCAG 
TGGCACTGAT TTGGCCCTTT 
CTAGTGGGCC AGCCCTGGCT 
AGCCCACAGT GAGAATTGGC 
AGTAGGGTCT GTGACCAGGC 
CCCAGGTAGG CCCCAGGCAC 
CCACTGGCAG GAGTGGATGG 
CAGGGGCCTG TCAGCACTCC 
AG C AAAGGG C AAGTGGAAGT 
TCTAGAATTG TAACTTGTTG 
NAAGCCTGNA ATCCTGGAAG 
ACTCATCAAA ACCAGCCACG 
CTGGCTCCTG TGGTGCCTCT 
AGTGCTGCCT GGAGGAGGTG 
CCAACAGCAG GAG G AGGGG C 
TGCCCTGTGG GGTGGANCCC 
GCCCCTCAGG ATGAAGGCTG 
CCAGC ACAAG AAGCTGGAGA 
CTGAGAGGGT CCCATGACGG 



CTCCTGGATC 13 000 

GGCAAGGAGC 13 050 

GGTGGTGCCA 13100 

NTCGGGCTGT 13150 

ATGTTTAGCC 132 00 

GGTGACAGTG 13 250 

TCCTGGGCTG 13 3 0 0 

GAGGCCTGAG 13 3 50 

CTCNTAAGCC 134 00 

GGGTAGGATT 13450 

CTGGGGNTGC 13 500 

TGACAATGAC 13 550 

AGCCCAGCTG 13 600 

CCATACGGCT 13 650 

C AG AG AAGGC 13700 

TGGAGGGACT 13 750 

TCACTCCCAA 13800 

GGCTG ATGTG 13 8 50 

AATGGTTAGA ,13 900 

GGGCAGGAGC 13 950 

TTCTCAAGGT 14 000 

CGTCTTCCCA 14050 

CTTGCTCATC 14100 

GCTCATGGGC 14150 

AGTGCCGTGG 142 00 

CGGGGCCCAG 14250 
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GCAGCCTCCG GGCGTGTGAA GAACACCTCC 
TTTTTTTTGT TTTGTTTTCG TTTTTCATCT 
CAAGAGGAGA CCCCCAGGCA GAGGGGCGTT 
TTCCAAAGAG CCGCGTCCCG GCAAGTCCGG 
GAAGCTGCTG TGTCCTCTAG TTGAGTTTCT 
CATGTGTGCC TGGCCGCAGG GCGGGGCTGG 
TTAACTGAAG CTTCGGCCGC ACCACCCGGG 
CAGCTGCTGT GGGTGGGGCC CAGACACCAG 
AGACGGTCTG TGTGCTGTTT GAAAATAAAT 
GAAACAAAAA AAAAATGATA AAAACT CTCA 
TCTCCCTCTT CTTGTCCGTG AGTGCGGATG 
TTCTTACTGA GATGGTCTGC CCCCGAAGGC 
CACCACAGGG CTTCACCCCC TGTCCCCTGG 
TGGANGTCAN GGACTGGGGG CTTGGGTTAA 
TTGTCCCANA AATCTTGCTN ACTGCCCCCC 



TCCCGAAAAA 


TGTGTGGTTC 


14300 


TTTGAAGAGC 


AAAGGGAAAT 


14350 


CTCCCAAAGT 


TTAGGTCGTT 


14400 


CGGAATTCAC 


CAGTGTTCCT 


14450 


GGCGCCCCTG 


CCTGTGCCCG 


14500 


GGG CTGCCGA 


GCCACCATAC 


14550 


GAAGGGTCCT 


CTTTTCCTGG 


14600 


CCTAGCCTGC 


TCTGCCCCGC 


14650 


CTTAGTGTTC 


AAAACAAAAT 


14700 


GAAAACGTGT 


GTGTATTTGT 


14750 


GAACCGTGTN 


AT CTGTGGCT 


14800 


CCGCTGCCCI 


GNCGCTGGTG 


14850 


GGTTCTTAGG 


GGTGGTCACC 


14900 


GGGGCTTGGC 


CACCCATCTC 


14950 


TAACT 




14985 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Phe Thr Arg His Pro Pro Val Leu Thr Pro Pro Asp Gin 
1 5 1° 

lie 
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WHAT IS CLAIMED IS: 

1. A mammalian nucleic acid sequence encoding a 
Box-dependent myc-interacting polypeptide 1 (Binl) or a 
fragment thereof, isolated from cellular materials with 
which it is naturally associated, selected from the group 
consisting of: 

(a) SEQ ID NO: 1; 

(b) SEQ ID NO: 3; 

(c) SEQ ID NO: 6; 

(d) a sequence which hybridizes to (a) - (c) 
under stringent conditions ; 

(e) an allelic variation of (a) - (c) ; and 

(f ) a fragment of (a) - (c) . 

2 . The sequence according to claim 1 which encodes 
murine Binl SEQ ID NO: 2 or a fragment thereof. 

3 . The sequence according to claim 1 which encodes 
human Binl SEQ ID NO: 4 or a fragment thereof. 

4 • The sequence according to claim 1 wherein the 
fragment is selected from the group consisting of the SH3 
domain, about nucleotides 891 to about 1412 of SEQ ID NO: 
3; the NLS, about nucleotides 813-854 of SEQ ID NO: 3; 
and the MBD, about nucleotides 867-908 of SEQ ID NO: 3. 

5. A partial murine cDNA sequence SEQ ID N0:1 
which encodes a Box-dependent myc-interacting 
polypeptide. 

6. A human cDNA sequence SEQ ID NO: 3 which encodes 
a Box-dependent myc-interacting polypeptide. 
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7. A human genomic DNA sequence SEQ ID NO: 6 which 
encodes a Box-dependent myc-interacting polypeptide. 

8. A mammalian Box-dependent myc-interacting 
polypeptide Binl, said polypeptide having the amino acid 
sequence selected from the group consisting of: 

(a) murine Binl, SEQ ID NO: 2; 

(b) human Binl, SEQ ID NO: 4; 

(c) a fragment of (a) or (b) having Binl 
biological activity; and 

(d) analogues of (a) or (b) characterized by 
having at least 90% homology with SEQ ID NO: 2 or SEQ ID 
NO : 4 . 

9. The polypeptide according to claim 8, wherein 
the fragment of SEQ ID NO: 4 is selected from the group 
consisting of: 



(a) 


amino 


acids 


278-451 Of SEQ 


ID NO: 4; 


(b) 


amino 


acids 


270-383 Of SEQ 


ID NO: 4; 


(c) 


amino 


acids 


252-265 of SEQ 


ID NO: 4; 


(d) 


amino 


acids 


190-250 of SEQ 


ID NO: 4; 


(e) 


amino 


acids 


263-397 of SEQ 


ID NO: 4; 


"(f) 


amino 


acids 


223-251 Of SEQ 


ID NO: 4; 


(g) 


amino 


acids 


1-222 of SEQ ID NO: 4; and 


(h) 


smaller fragments of (a) - 


(g) comprising 



about 8 amino acids. 

10 . A vector comprising a mammalian nucleic acid 
sequence encoding a Box-dependent myc-interacting 
polypeptide (Binl) under the control of suitable 
regulatory sequences . 

11. The vector according to claim 10 wherein said 
vector is an expression vector. 
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12. The vector according to claim 10 wherein said 
vector is a gene therapy vector. 

13 • A host cell transformed with the vector 
according to claim 10. 

14. An oligonucleotide probe comprising a nucleic 
acid sequence selected from the group consisting of: 

(a) SEQ ID NO:l; 

(b) SEQ ID NO: 3; 

(c) SEQ ID NO: 6; 

(d) a nucleic acid fragment of (a) - (c) 
comprising at least 15 nucleotides in length, 

and a detectable label which is associated with 
said sequence ♦ 

15. An antibody raised against a Box-dependent myc- 
interacting peptide (Binl) , said peptide selected from 
the group consisting of : 

(a) SEQ ID NO:2; 

(b) SEQ ID NO:4 ; 

(c) amino acids 190-250 of SEQ ID NO: 4; 

(d) amino acids 252-261 of SEQ ID NO: 4; 

(e) amino acids 263-397 of SEQ ID NO: 4; and 

(f) a fragment of (a) to (d) comprising about 8 
amino acids. 

16. The antibody according to claim 15 which 
recognizes an epitope in the region of amino acids 19 O - 
250 of SEQ ID NO:4 . 

17. The antibody according to claim 16 which 
recognizes a complex epitope in the regions of amino 
acids 190 - 250 and amino acids 263 - 397. 
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18. The antibody according to claim 15, selected 
from the group consisting of a chimeric antibody, a 
humanized antibody, a monoclonal antibody and a 
polyclonal antibody . 

19. An anti-idiotype antibody specific for the 
antibody of claim 15. 

20. A diagnostic reagent comprising the antibody 
according to claim 15 and a detectable label* 

21. A therapeutic composition comprising the vector 
according to claim 12 and a pharmaceutical ly acceptable 
carrier. 

22. A therapeutic composition comprising the anti- 
idiotype antibody according to claim 19 and a 
pharmaceutical^ acceptable carrier - 

23. A therapeutic composition comprising a 
pharmaceutical^ acceptable carrier and a mammalian Box- 
dependent my c- interacting polypeptide (Binl) selected 
from the group consisting of: 

(a) SEQ ID NO: 2; 

(b) SEQ ID NO: 4; and 

(c) a fragment of (a) or (b) having Binl 
biological activity . 

24. A method of detecting a cancer involving the c- 
mvc oncogene or a hyperplastic disease state comprising 
providing a biopsy sample from a patient suspected of 
having said cancer or disease and incubating said sample 
in the presence of a diagnostic reagent according to 
claim 20 or an oligonucleotide probe according to claim 
14 . 
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25. A method of detecting a deficiency in Box- 
dependent myc-interacting peptide in a patient comprising 
providing a sample from a patient suspected of having 
said deficiency and incubating said sample in the 
presence of a diagnostic reagent according to claim 20, 

26. A method of detecting a deficiency in Box- 
dependent myc-interacting peptide in a patient comprising 
providing a sample from a patient suspected of having 
said deficiency and performing the polymerase chain 
reaction using the oligonucleotide probe according to 
claim 14. 

27. The use of the mammalian nucleic acid sequence 
according to claim 1 or the polypeptide sequence 
according to claim 8 in the preparation of a medicament 
for treating deficiencies in Box-dependent myc- 
interacting peptide in a patient. 

28. The use of the mammalian nucleic acid sequence 
according to claim 1 or the polypeptide sequence 
according to claim 8 in the preparation of a medicament 
for treating a cancer or hyperplastic disease state 
involving the c- mvc oncogene. 
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FIGURE 1 



Partial Mouse MIP99 cDNA and Polypeptide 
SEQ ID NOS. 1 and 2 

GAG ATC AGA GTG AAC CAT GAG CCA GAG CCG GCC AGT GGG GCC TCA 45 
Glu lie Arg Val Asn His Glu Pro Glu Pro Ala Ser Gly Ala Ser 

5 10 15 



1 



CCC GGG GCT GCC ATC CCC AAG TCC CCA TCT CAG CCA GCA GAG GCC 
Pro Gly Ala Ala lie Pro Lys Ser Pro Ser Gin Pro Ala Glu Ala 

20 25 30 



90 



TCC GAG GTG GTG GGT GGA GCC CAG GAG CCA GGG GAG ACA GCA GCC 135 
Ser Glu Val Val Gly Gly Ala Gin Glu Pro Gly Glu Thr Ala Ala 

35 40 45 

AGT GAA GCA ACC TCC AGC TCT CTT CCG GCT GTG GTG GTG GAG ACC 180 
Ser Glu Ala Thr Ser Ser Ser Leu Pro Ala Val Val Val Glu Thr 

50 55 60 

TTC TCC GCA ACT GTG AAT GGG GCG GTG GAG GGC AGC GCT GGG ACT 225 
Phe Ser Ala Thr Val Asn Gly Ala Val Glu Gly Ser Ala Gly Thr 

65 70 75 

GGA CGC TTG GAC CTG CCC CCG GGA TTC ATG TTC AAG GTT CAA GCC 270 
Gly Arq Leu Asp Leu Pro Pro Gly Phe Met Phe Lys Val Gin Ala 

80 85 90 

CAG CAT GAT TAC ACG GCC ACT GAC ACT GAT GAG CTG CAA CTC AAA 315 
Gin His Asp Tyr Thr Ala Thr Asp Thr Asp Glu Leu Gin Leu Lys 

95 100 105 

GCT GGC GAT GTG GTG TTG GTG ATT CCT TTC CAG AAC CCA GAG GAG 360 
Ala Glv Asp Val Val Leu Val He Pro Phe Gin Asn Pro Glu Glu 

110 H5 120 

CAG GAT GAA GGC TGG CTC ATG GGT GTG AAG GAG AGC GAC TGA 4 02 

Gin Asp Glu Gly Trp Leu Met Gly Val Lys Glu Ser Asp 

125 130 



SUBSTITUTE SHEET PLE 26) 
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FIGURE 2A 



Human MIP99 cDNA and Polypeptide 
SEQ ID NOS. 3 and 4 



GAATTCCGTG CTGGTTGAGC TTGCTCATCT CCTTGTGGAA GTTTTCCTCC 50 

AGGCCCGCG ATG CTC TGG AAC GTG GTG ACG GCG GGA AAG ATC GCC 95 
Met Leu Trp Asn Val Val Thr Ala Gly Lys lie Ala 
15 lO 

AGC AAC GTG CAG AAG AAG CTC ACC CGC GCG CAG GAG AAG GTT CTC 140 
Ser Asn Val Gin Lys Lys Leu Thr Arg Ala Gin Glu Lys Val Leu 
15 20 25 

CAG AAG CTG GGG AAG GCA GAT GAG ACC AAG GAT GAG CAG TTT GAG 185 
Gin Lys Leu Gly Lys Ala Asp Glu Thr Lys Asp Glu Gin Phe Glu 
30 35 40 

CAG TGC GTC CAG AAT TTC AAC AAG CAG CTG ACG GAG GGC ACC CGG 230 
Gin Cys Val Gin Asn Phe Asn Lys Gin Leu Thr Glu Gly Thr Arg 
45 50 55 

CTG CAG AAG GAT CTC CGG ACC TAC CTG GCC TCC GTC AAA GCC ATG 275 
Leu Gin Lys Asp Leu Arg Thr Tyr Leu Ala Ser Val Lys Ala Met 
60 65 70 

CAC GAG GCT TCC AAG AAG CTG AAT GAG TGT CTG CAG GAG GTG TAT 320 
His Glu Ala Ser Lys Lys Leu Asn Glu Cys Leu Gin Glu Val Tyr 
75 80 85 

GAG CCC GAT TGG CCC GGC AGG GAT GAG GCA AAC AAG ATC GCA GAG 365 
Glu Pro Asp Trp Pro Gly Arg Asp Glu Ala Asn Lys lie Ala Glu 
90 95 100 

AAC AAC GAC CTG CTG TGG ATG GAT TAC CAC CAG AAG CTG GTG GAC 410 
Asn Asn Asp Leu Leu Trp Met Asp Tyr His Gin Lys Leu Val Asp 
105 110 115 

CAG GCG CTG CTG ACC ATG GAC ACG TAC CTG GGC CAG TTC CCC GAC 455 
Gin Ala Leu Leu Thr Met Asp Thr Tyr Leu Gly Gin Phe Pro Asp 
120 125 130 

ATC AAG TCA CGC ATT GCC AAG CGG GGG CGC AAG CTG GTG GAC TAC 500 
lie Lys Ser Arg lie Ala Lys Arg Gly Arg Lys Leu Val Asp Tyr 
135 140 145 

GAC AGT GCC CGG CAC CAC TAC GAG TCC CTT CAA ACT GCC AAA AAG 545 
Asp Ser Ala Arg His His Tyr Glu Ser Leu Gin Thr Ala Lys Lys 
150 155 160 
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FIGURE 2B 

AAG GAT GAA GCC AAA ATT GCC AAG GCC GAG GAG GAG CTC ATC AAA 590 
Lys Asp Glu Ala Lys lie Ala Lys Ala Glu Glu Glu Leu lie Lys 
165 170 175 

GCC CAG AAG GTG TTT GAG GAG ATG AAT GTG GAT CTG CAG GAG GAG 635 
Ala Gin Lys Val Phe Glu Glu Met Asn Val Asp Leu Gin Glu Glu 
180 185 190 

CTG CCG TCC CTG TGG AAC AGC CGC GTA GGT TTC TAC GTC AAC ACG 680 
Leu Pro Ser Leu Trp Asn Ser Arg Val Gly Phe Tyr Val Asn Thr 
195 200 205 

TTC CAG AGC ATC GCG GGC CTG GAG GAA AAC TTC CAC AAG GAG ATG 725 
Phe Gin Ser lie Ala Gly Leu Glu Glu Asn Phe His Lys Glu Met 
210 215 220 

AGC AAG CTC AAC CAG AAC CTC AAT GAT GTG CTG GTC GGC CTG GAG 770 
Ser Lys Leu Asn Gin Asn Leu Asn Asp Val Leu Val Gly Leu Glu 
225 230 235 

AAG CAA CAC GGG AGC AAC ACC TTC ACG GTC AAG GCC CAG CCC AGA 815 
Lys Gin His Gly Ser Asn Thr Phe Thr Val Lys Ala Gin Pro Arg 
240 245 250 

AAG AAA AGT AAA CTG TTT TCG CGG CTG CGC AGA AAG AAG AAC AGT 860 
Lys Lys Ser Lys Leu Phe Ser Arg Leu Arg Arg Lys Lys Asn Ser 
255 260 265 

GAC AAC GCG CCT GCA AAA GGG AAC AAG AGC CCT TCG CCT CCA GAT 905 
Asp Asn Ala Pro Ala Lys Gly Asn Lys Ser Pro Ser Pro Pro Asp 
270 275 280 

GGC TCC CCT GCC GCC ACC CCC GAG ATC AGA GTC AAC CAC GAG CCA 950 
Gly Ser Pro Ala Ala Thr Pro Glu lie Arg Val Asn His Glu Pro 
285 290 295 

GAG CCG GCC GGC GGG GCC ACG CCC GGG GCC ACC CTC CCC AAG TCC 995 
Glu Pro Ala Gly Gly Ala Thr Pro Gly Ala Thr Leu Pro Lys Ser 
300 305 310 

CCA TCT CAG CCA GCA GAG GCC TCG GAG GTG GCG GGT GGG ACC CAA 1040 
Pro Ser Gin Pro Ala Glu Ala Ser Glu Val Ala Gly Gly Thr Gin 
315 320 325 

CCT GCG GCT GGA GCC CAG GAG CCA GGG GAG ACT TCT GCA AGT GAA 1085 
Pro Ala Ala Gly Ala Gin Glu Pro Gly Glu Thr Ser Ala Ser Glu 
330 335 340 
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FIGURE 2C 



GCA GCC TCC AGC TCT CTT CCT GCT GTC GTG GTG GAG ACC TTC CCA 1130 
Ala Ala Ser Ser Ser Leu Pro Ala Val Val Val Glu Thr Phe Pro 
345 350 355 

GCA ACT GTG AAT GGC ACC GTG GAG GGC GGC AGT GGG GCC GGG CGC 1175 
Ala Thr Val Asn Gly Thr Val Glu Gly Gly Ser Gly Ala Gly Arg 
360 365 370 

TTG GAC CTG CCC CCA GGT TTC ATG TTC AAG GTA CAG GCC CAG CAC 1220 
Leu Asp Leu Pro Pro Gly Phe Met Phe Lys Val Gin Ala Gin His 
375 380 385 

GAC TAC ACG GCC ACT GAC ACA GAC GAG CTG CAG CTC AAG GCT GGT 1265 
Asp Tyr Thr Ala Thr Asp Thr Asp Glu Leu Gin Leu Lys Ala Gly 
390 395 400 

GAT GTG GTG CTG GTG ATC CCC TTC CAG AAC CCT GAA GAG CAG GAT 1310 
Asp Val Val Leu Val He Pro Phe Gin Asn Pro Glu Glu Gin Asp 
405 410 415 

GAA GGC TGG CTC ATG GGC GTG AAG GAG AGC GAC TGG AAC CAG CAC 1355 
Glu Gly Trp Leu Met Gly Val Lys Glu Ser Asp Trp Asn Gin His 
420 425 430 

AAG AAG CTG GAG AAG TGC CGT GGC GTC TTC CCC GAG AAC TTC ACT 1400 
Lys Lys Leu Glu Lys Cys Arg Gly Val Phe Pro Glu Asn Phe Thr 
435 440 445 

GAG AGG GTC CCA TGACGGCGGG GCCCAGGCAG CCTCCGGGCG TGTGAAGAAC 1452 
Glu Arg Val Pro 
450 



ACCTCCTCCC GAAAAATGTG TGGTTCTTTT TTTTGTTTTG TTTTCGTTTT 1502 
TCATCTTTTG AAGAGCAAAG GGAAATCAAG AGGAGACCCC CAGGCAGAGG 1552 
GGCGTTCTCC CAAAGTTTAG GTCGTTTTCC AAAGAGCCGC GTCCCGGCAA 1602 



GTCCGGCGGA ATTCACCAGT 
GTTTCTGGCG CCCCTGCCTG 
GGCTGGGGGC TGCCGAGCCA 
CCCGGGGAAG GGTCCTCTTT 
CACCAGCCTA GCCTGCTCTG 
ATAAATCTTA GTGTTCAAAA 
ACTCTCAAAA AAACAAGGAA 



GTTCCTGAAG CTGCTGTGTC 
TGCCCGCATG TGTGCCTGGC 
CCATACTTAA CTGAAGCTTC 
TCCTGGCAGC TGCTGTGGGT 
CCCCGCAGAC GGTCTGTGTG 
CAAAATGAAA CAAAAAAAAA 
TTC 



CTCTAGTTGA 1652 

CGCAGGGCGG 1702 

GGCCGCACCA 1752 

GGGGCCCAGA 1802 

CTGTTTGAAA 1852 

AATGATAAAA 1902 



1925 
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